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Abstract 

We consider the problem of partitioning n integers into two subsets of given cardinalities 
such that the discrepancy, the absolute value of the difference of their sums, is minimized. 
The integers are i.i.d. random variables chosen uniformly from the set {1,...,M}. We 
study how the typical behavior of the optimal partition depends on n, M and the bias s, 
the difference between the cardinalities of the two subsets in the partition. In particular, 
we rigorously establish this typical behavior as a function of the two parameters k := 
n~^log2M and h := \s\/n by proving the existence of three distinct "phases" in the nb- 
plane, characterized by the value of the discrepancy and the number of optimal solutions: a 
"perfect phase" with exponentially many optimal solutions with discrepancy or 1 ; a "hard 
phase" with minimal discrepancy of order Me"®*-"-'; and a "sorted phase" with an unique 
optimal partition of order Mn, obtained by putting the (s + n)/2 smallest integers in one 
subset. Our phase diagram covers all but a relatively small region in the «;6-plane. We 
also show that the three phases can be alternatively characterized by the number of basis 
solutions of the associated linear programming problem, and by the fraction of these basis 
solutions whose ±l-valued components form optimal integer partitions of the subproblem 
with the corresponding weights. We show in particular that this fraction is one in the 
sorted phase, and exponentially small in both the perfect and hard phases, and strictly 
exponentially smaller in the hard phase than in the perfect phase. Open problems are 
discussed, and numerical experiments are presented. 
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1 Introduction 

Phase transitions in random combinatorial problems liave been tlie subject of mucli recent 
attention. The random optimum partitioning problem is the only NP-hard problem for 
which the existence of a sharp phase transition has been rigorously established, as have 
many detailed properties of the transition ([3], see Q for a short overview). Here we study 
a constrained version of the random optimum partitioning problem, and extend some of 
the results of to that case. 

The integer optimum partitioning problem is a classic problem of combinatorial op- 
timization in which a given set of n integers is partitioned into two subsets in order to 
minimize the absolute value of the difference between the sum of the integers in the two 
subsets, the so-called discrepancy. Notice that for any given set of integers, the discrep- 
ancies of all partitions have the same parity, namely that of the sum of the n integers. 
We call a partition perfect if its discrepancy is 0, when this sum is even, or 1, when this 
sum is odd. The decision question is whether there exists a perfect partition. In the 
uniformly random version, an instance is a given a set of n i.i.d. integers drawn uniformly 
at random from {1,2,..., M}. We will sometimes use the notation m = log2 M; notice 
that each of the random integers has m binary bits. Previous work had established a 
sharp transition as a function of the parameter k := m/n, characterized by a dramatic 
change in the probability of a perfect partition. For M and n tending to infinity in the 
limiting ratio k, = m/n, the probability of a perfect partition tends to for k < 1, while 
the probability tends to 1 for k > 1. This result was suggested by the work of one of the 
authors [Hj and proved in a paper by the three other authors [S]. See [12] for a beautiful 
introduction to the optimum partitioning phase transition and some of its properties. 

Here we consider a constrained variant of the problem in which we require that the two 
subsets have given cardinalities; we say that the difference of the two cardinalities is the 
bias, s, of the partition. We establish the phase diagram of the random constrained integer 
partitioning problem as a function of the two parameters k, := m/n and b := \s\/n. In 
the language of statistical physics, b would be called the magnetization, and the problem 
considered here, where b is constrained to assume a particular value, would be called the 
"microcanonical" integer partitioning problem. Microcanonical problems are known to 
be much more difficult than their unconstrained analogues, particularly in the case of 
random systems. 

Let us first review previous rigorous and nonrigorous work on the random optimum 
partitioning problem. A good deal of rigorous work has been done for the unconstrained 
random partitioning problem with random numbers drawn from a compact interval in M, 
which can be interpreted informally as the limiting case of m ^ n. Karmarkar and Karp 
gave a linear time algorithm for a suboptimal solution with a typical discrepancy of 
order at most 0{n~'^^°^"') for some constant c > 0. Confirming a conjecture by Karmakar 
and Karp, Yakir 5J proved that the expected discrepancy delivered by this algorithm is 
indeed n~^'-'°s"^ The optimum solution was studied by Karmarkar, Karp, Lueker and 
Odlyzko who proved that the typical minimum discrepancy is much smaller, with the 
median of order 9{2~"'n^^'^). More recently, Lueker ^Hj proved exponential bounds for 
the expected minimum discrepancy. Loosely speaking, these results correspond to m far 
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exceeding n, and hence k oo, thus well above the phase transition of the unconstrained 
problem which occurs for k = 1. 

There have also been (nonrigorous) studies of optimum partitioning in the theoretical 
physics and artificial intelligence communities, where the possibility of a phase transition 
was examined. Fu [Hj noted that the minimum discrepancy is analogous to the ground 
state energy of an infinite-range, random antiferromagnetic spin model, but concluded 
incorrectly that the model did not have a phase transition. Gent and Walsh ^HI studied 
the problem numerically and introduced the parameter k = m/n. They noticed that the 
number of perfect partitions falls off dramatically at a transition point estimated to be 
close to K = 0.96. Ferreira and Fontanari studied the random spin model of Fu, and 
used statistical mechanical methods to get estimates of the optimum partition [7] and to 
evaluate the average performance of simple heuristics 0. Ferreira and Fontanari [7] also 
considered the constrained optimum partitioning problem, noted that the constrained 
problem is analogous to putting the random antiferromagnet in an external field, and 
observed that the problem becomes much easier when the bias parameter b satisfies b > 
V2-1. 

Returning to the unconstrained problem, one of the authors of this paper used sta- 
tistical mechanical methods and the parameterization of Gent and Walsh to derive a 
compelling, but nonrigorous argument for a phase transition at k = 1, and also derived 
many of the properties of the transition JJj. In a later work, this author analyzed 
Fu's model by using a heuristic approximation known in statistical mechanics as Derrida's 
random energy model [H], and obtained the limiting distribution of the k-th. smallest dis- 
crepancy. 

Motivated by the statistical mechanics analysis in ^Tj and ^H] , the other three authors 
of this paper undertook an extensive rigorous study of the random integer partitioning 
problem |3j. They established the existence of a transition at = 1 below which the 
probability of a perfect partition tends to one with n and m, and above which it tends 
to zero, and also gave the finite-size scaling window of the transition: namely, in terms 
of the more detailed parametrization m = k„ with = 1 — log2n/{2n) + Xn/n, the 
probability of a perfect partition tends to 1, 0, or a computable A-dependent constant 
strictly between and 1, depending on whether A„ tends to — oo, oo, or A G (—00,00), 
respectively. The work also calculated the distribution of the number of perfect partitions, 
the distribution of the minimum discrepancy, and the joint distribution of the k smallest 
discrepancies, which give the entropy, the ground state energy and the bottom of the 
energy spectrum, respectively. In particular, the paper [2j provided a rigorous justification 
of the Derrida-type approximation both inside and above the scaling window, insofar as 
the joint distribution of k (finite) smallest discrepancies is concerned. 

The location of the phase transition for the unconstrained problem immediately yields 
a one-dimensional phase diagram as a function of k: For k, G (0, Kc) with = 1, the 
system is in a "perfect phase" in which the probability of a perfect partition tends to 1 
as M and n tend to infinity in the fixed function k. For n G {nc, 00), the probability of a 
perfect partition tends to 0, and moreover, there is an unique optimal partition. We call 
this the "hard phase," since for k > Kc, it is presumably computationally difficult to find 
the optimal partition. 
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In this work, we consider the constrained optimum partitioning problem with bias s 
and extend the phase diagram to the two-dimensional nb-plane. See Figure^ In addition 
to the extensions of the perfect and hard phases, we establish the existence of a new phase 
which we call the "sorted phase." 



bo.3 




Figure 1: Phase diagram of the constrained integer partitioning problem. 



The sorted phase is easy to understand. One way to meet the bias constraint is to 
take the {s + n)/2 smallest integers and put them in one subset of the partition.^ It is 
not difficult to see that the resulting "sorted partition" is optimal if the total weight of 
this subset is at least half of the sum of all n integers. We define the sorted phase as the 
subset of the /t6-plane where the sorted partition is optimal. We prove that the sorted 
phase is given by the condition 

b>b,:=V2-l, (1.1) 

see region III in Figure ^ Moreover, we show that the minimal discrepancy in this phase 
is of the order Mn. The region b > — 1 is precisely where Ferreira and Fontanari |7j 
observed that the corresponding statistical mechanical problem becomes "self- averaging." 

Our analysis of the perfect and hard phases for 6 < 6c is much more difficult. In 
this region, we use integral representations for the number of partitions with a given 

^Note that the task of finding this partition is even easier than the task of sorting the n integers, 
which would take, on average, 9{n\ogn) comparisons. Instead, the (s + n)/2 smahest integers can be 
found in strictly linear time in n. 
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discrepancy and bias; these representations generalize tliose used in The asymptotic 
analysis of the resulting two-dimensional random integrals leads to saddle point equations 
for a saddle point described in terms two real parameters t] and (. For discrepancies of 
order o(M) (including, in particular, the case of perfect partitions), the saddle point 
equations determining ( and t] are: 

1 

J X tanh((^x + r]) dx = 0, 

(1.2) 

J tanh{(x + 1]) dx = —b. 



The solution (C, rj) of these equations can be used to define the two convex curves in 
Figure m To this end, let^ 

1 

L(C,,):=6, + /log(2coBh(C. + „))d. (1.3) 



p(C, ,):=!- t^"MC + ^^)^-tanh(^)_ ^^ ^^ 

For {(, rj) a solution of (jl.2|) . we then define 



K_(6) : = -log2p(C,^?), 
«:e(6): = ^L(C,r/). 



1.5) 



From bottom to top, the two convex curves joining (0, be) and (1, 0) in Figure Q are then 
given by K = H-{b) and k = Kc{b). 

In this paper, we prove that, in the region k < K,-{b), with probability tending to 
one as n tends to infinity (or, more succinctly, with high probability, w.h.p.) there exist 
perfect partitions; see region I in Figure ^ Moreover the number of perfect partitions 
is about 2^'^'="'^)"' in this "perfect phase." We also prove that w.h.p. there are no perfect 
partitions in the region b < be and k, > Hc{b). As in the unconstrained problem, we call 
this the "hard phase." Our results leave open the question of what happens in the narrow 
region Kc, and also whether the optimal partition is unique in the hard phase; 

see the final section for a discussion of this and other open questions. 

We are also able to prove that these phase transitions correspond to qualitative changes 
in the solution space of the associated linear programming problem (LPP). In the actual 
optimum partitioning problem, each integer is put in one subset or the other. The relaxed 
version is defined by allowing any fraction of each integer to be put in either of the two 
partitions. Using our theorems on the typical behavior of integer partitioning problem and 

^It turns out the solutions of the saddle point equations 11.21 are just the stationary points of the 
function i(C, '7) 
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some general properties of the LPP, we show the following. In the sorted phase, i.e. for 
b > be = V^— 1, w.h.p. the LPP has a unique solution given by the sorted partition itself. 
For b < be, i.e. in the perfect and hard phases, w.h.p. the relaxed minimum discrepancy 
is zero, and the total number of optimal basis solutions is exponentially large, of order 
2fcc(6)".+Op(ni/2)^ Finally, in the perfect and hard phases, we consider the fraction of these 
basis solutions whose integer-valued components form an optimal integer partition of the 
subproblem with the corresponding subset of the weights. We show that this fraction is 
exponentially small. Moreover, except for the crescent-shaped region between k = H-{b) 
and K = K,c{b), we show that the fraction is strictly exponentially smaller in the hard phase 
than in the perfect phase. This fraction thus represents some measure of the algorithmic 
difficulty of the problem, see Remark |8.1[ 

The outline of the paper is as follows. In the next section, we define the problem 
in detail, and precisely state our main results. In Section IHl we introduce our integral 
representation and show how it leads to the relevant saddle point equations. We also give 
a brief heuristic derivation of some of the phase boundaries. Section 0] contains a proof 
of existence and properties of the solution of the saddle point equations for b < be- In 
Section we establish an asymptotic formula for the number of partitions with given 
discrepancy and bias in the perfect phase. As a corollary, we obtain both the existence of 
exponentially many perfect partitions for k < and a theorem on the distribution 

of the bias in the unconstrained problem for n < 1. The analysis of the hard phase 
is done in Section IHl In Section [71 we prove that the sorted partitions are optimal for 
b > be- We also show why the sorted phase boundary coincides with the boundary for 
existence of solutions of the saddle point equations. In Section|Hl we formulate the relaxed 
version of the optimum partitioning problem, and establish our results on the space of 
basis solutions of the LPP. Finally, in Section IHl we discuss open problems and a few 
numerical experiments addressing some of these problems. 



2 Statement of Main Results 

Let Xi, . . . , Xn be n independent copies of a generic random variable which is distributed 
uniformly on {1, ... , M}. We are interested in the case when M grows exponentially with 
n, and define the exponential rate, i.e. 

K=-\ogoM. (2.1) 

n 

To avoid trivial counterexamples, we will always assume that k stay bounded away from 
both and oo as n — oo. We will use P and E, with or without sub index n, to denote 
the probability measure and the expectation induced by X = (Xi, . . . , X„). 

A partition of integers into two disjoint subsets is coded by an n-long binary sequence 
cr = (cTi, . . . , cr„), aj G {—1,1}; so the subsets are {j : aj = 1} and {j : aj = — 1}. 
Obviously a and —a are the codes of the same partition. Given a partition cr, we define 
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its discrepancy (or energy), d(X.,cr), and bias (or magnetization), s(cr), as 

n 

d{X,(T) =\(T ■ X|, with a ■ X = ^ajXj, (2.2) 

.(^) = a ■ e = |{j : a,- = 1}| - \{j : a, = -1}|. (2.3) 

Here e is the vector (!,...,!). Clearly s(cr) is an integer in {— n, ...,n}, so let s G 
{—n, . . . , n} and define the bias density 

b = ^ (2.4) 

n 

so that b G [0, 1]. Note that by symmetry it suffices to consider s(cr) g {0, . . . , n}, so we 
will often take a non-negative integer s G {0, . . . , n}, in which case s = bn. We define an 
optimum partition as a partition cr that minimizes the discrepancy d(X., cr) among all the 
partitions with bias equal to s, and a perfect partition as a partition cr with |(i(X, cr)| < 1. 

Theorems El O and O below describe our main results on the phases labelled I, II, 
and III in Figure 1 in the introduction. In the statement of these theorems we will use 
the parameters (,ri,Kc{b) and K-(6) defined in p.2|l - Before getting to principal 

results, we must begin with an existence statement for the parameters (,ri. 

Theorem A Let b < b^, where be = — 1. Then the saddle point equations ()1.2|) have 
a unique solution (Crj) = {({b),ri{b)) . 

This theorem is proved in Section 0] 
Let 

Z„(£,s) = Z„(£,s;X) (2.5) 

denote the random number of partitions cr with cr • X = £ and cr ■ e = s. Since s{cr) has 
the same parity as n, and d(K, cr) has the same parity as Yl]j=i -^ji ^^^^ only consider 
values of s which have the same parity as n, and values of C. which have the same parity 
as 'YTj=i ^j- theorems in this section and in much of the rest of the paper, we will 

not state these restrictions explicitly. 

Our central goal is to use the saddle point solution in order to bound the Zn{i, s) for 
various given values of i and s. To formulate our results in a compact, yet unambiguous 
form, we use a shorthand an < a (a„ > a, resp.) instead of limsup an < a (liminf a„ > a, 
resp.), even when the n-dependence of a„ is only implicit, as in = ?T,~^log2M and 
b = \s/n\. We will also use the notation = Op{gn) and /„ = Op{hn) if fn/9n is bounded 
in probability and fn/hn goes to zero in probability, respectively. Also, as is customary, 
we will say that an event happens with high probability (w.h.p.) if the probability of this 
event approaches 1 as n ^ oo. In all our statements n, M, s and £ will be integers with 
n > 1, M > 1 and s > 0. Our main results in the perfect phase are summarized in the 
next theorem and remark. 

Theorem B Let i = o{Mn^/'^),b < be and k < /t_(6). Then w.h.p. Zn{i,s) > 1 and 

Zn{i, s) = 2[«-W— l"e^""''"+°("'^'\ (2.6) 
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where Sn converges in probability to a Gaussian with mean zero and variance cr^ = 
Var(log(2 cosh(^f/ + 77))), with U uniformly distributed on [0,1]. Consequently, w.h.p., 
there exist exponentially many perfect partitions, with i = if^jXj is even, and \£\ = 1 
if Xj is odd. 

Remark 2.1 Under the conditions of Theorem\^ we actually prove a much more accu- 
rate estimate. Namely, we show that there are 2x2 positive definite matrices R and K 
with deterministic entries, and a constant q <1 such that, with probability 1 — 0(g'°s 

.) = exp + r^. + ^ log(2 cosh(CX,/M + r/)) j ^^^^J___-^ (l + o(l)). 

(2.7) 

Here Tn is a two-dimensional random vector which converges in probability to a Gaussian 
vector T with zero mean and covariance matrix K. See Theorem \5.1\ in Section \5. 1\ 

We also prove a corollary relating the distribution of the bias in the unconstrained 
problem to the distribution of the bias between heads and tails in fair coin flips; see Sub- 
section [5~R 

The proof of Theorem ^ and Remark 12.11 is given in Section El 

Note that the above expression for Zn{i, s) is much more complicated than its analogue 
in the unconstrained case, see equation (2.6) in [Hj. Both the sum in the first exponent 
and the entire second exponent represent fluctuations which were not present in the 
unconstrained case, and which make the analysis of the perfect phase much more difficult 
here; see also Remark 12.41 below. 

Our next theorem, which describes our main results on the hard phase, has two parts: 
The first shows that there are no perfect partitions above k = Kc{b), and the second gives 
a bound on the number of optimum partition for k > k_. To state the theorem, let 
dopt = dopt{n; s) denote the discrepancy of the optimal partition, and let Zgpt = Zopt{n; s) 
denote the number of optimal partitions. 

Theorem C Let b < be. 

a) If n > Kc{b), then there exists a 5 > such that with probability 1 — 0{e~^^°^^ 
there are no perfect partitions, and moreover 

dopt > 2"[''-''^(^)l~^''("'^'). (2.8) 

b) If K > K-{b) and e > 0, then there exists a constant 6 > such that 

dopt < 2"["-"-('')+"l, (2.9) 

and 

^pt < 2"t''^(^)"''-(^)+"l, (2.10) 
both with probability 1 — 0(6"*^'°^^"). 
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This theorem is proved in Section IHl However, perhaps somewhat surprisingly, the proof 
of the upper bound in ()2.9|) runs parallel to the proof of Theorem ^ that established 
existence of perfect partitions for /t < 

Remark 2.2 We believe that the bound in ()2.8p is actually sharp. If we assume that this 
is the case, in fact, even if we assume that the weaker bound 

4pt = 2"('^-'^=+°*'(^)) (2.11) 

holds w.h.p. whenever k, > Kc, then we can significantly improve the upper bound fl2.10|) . 
Indeed, under the assumption ()2.11|) . Zopt grows subexponentially with n whenever k, > 
Kc{b), see Remark \(j.l\ (Hi). 

The optimum partition problem is much simpler for b > be- Our main result on the 
sorted phase is the following theorem, which is proved in Section [3 

Theorem D Letb > b^. Then w.h.p. the optimal partition is uniquely obtained by putting 
{s+n)/2 smallest integers Xj in one part, and the remaining {n—s)/2 integers into another 
part. W.h.p., dopt is asymptotic to ^ [(1 + 6)^ " 2] , i.e., of order Mn. 

By this theorem, for b sufficiently large, the partition is determined by the decreasing 
order of weights Xj, but not by the actual values of Xj. 

Until now, all our statements have been about the likely properties of the optimum 
partition cr, whose components are allowed to assume only two values, —1 and +1. In 
an interesting twist, these results shed light on the likely properties of the related linear 
programming problem (LPP): find the minimum value of d subject to linear constraints 

-d < E"=i ^i^i) E"=i (^j^j < d, 

En 
j=l = S> 

-1 < (Tj < 1, (l<j<n). 

We denote this minimum value of d by d^^^. Our last theorem indicates that the LPP 
inherits the phase diagram of the optimum partition problem, and moreover provides, 
however incomplete, some way to rate the three regions according the algorithmic difficulty 
of the optimal partition problem. To make this precise, we define Fn{K,, b) to be the fraction 
of basis solutions cr with the property that the ±l-valued components cXj form an optimal 
partition of the corresponding subproblem with weights Xj. Henceforth, we will call this 
the "optimal subpartition property." 

Theorem E a) If b > be, then w.h.p. the sorted partition is a unique solution of the 
LPP, and thus d^^/ = Q{Mn) and Fn{K,b) = 1. 

Let b < be- 

b) Then w.h.p. d^^^ = 0. In addition, w.h.p. there are 2['^=('')+°(^M" basis solutions, 
each having either none or exactly two components a-i ^ ±1. 



Constrained Integer Partitions (DRAFT, February 24, 2003) 



9 



c) W.h.p. Fn{K,h) = 2-['^+°(i)l"/orK < an(i 2-['^-(^)+°(^M" < < 2-['^-('')+°(i)] 

for K, > K_{b). 

Remark 2.3 (i) If one assume that the number of optimal partitions Zopt in the hard 
phase grows subexponentially with probability at least 1 — o(?2~^) (see Remark for a 
motivation of this assumption) , our upper bound on the fraction Fn{K,, b) in the hard phase 
can be improved to match the lower bound, yielding Fn{K,b) = 2~"['^<:('')+"(i)] in the hard 
phase, see Remark \8.1\ (i). 

(a) If, on the other hand, the asymptotics of Theorem \5. 1\ hold up to Kc, more precisely, 
if one assumes that for b < be and k < K^b) 

s) = 2"[''^('')-''+"W)l (2.12) 

holds with probability least 1 — o(n"^), then a bound of the form Fn{K,, b) = 2~'^['^+°(-'^)] can 
be extended to all k < k^, see Remark \8.1\ (ii). 

We close this section with a few additional remarks and an additional theorem on the 
expected number of perfect partitions. We start with a discussion of Theorem iBl 

Remark 2.4 While ()2.(i|l has only been proved for k, < K,-{b), an upper bound of the same 
form can be shown to hold for all k, see Theorem \6.1\ Due to the random fluctuations 
of the Gaussian term, this upper bound is of order e~®("^''^) with positive probability as 
soon as K > Kc{b) — 0{n^^^'^), so that in this regime, there are no perfect partitions 
with probability bounded away from zero. Note, however, that as b —>■ 0, the variance 
= Var(log(2 cosh(C[/ + 7/))) of the Gaussian term tends to zero. 

We expect that for all b e (0, be) fluctuations of this kind persist around the true 
threshold, whether it is actually equal to t^cip), or whether it is some other value kcib) < 
K,c{b). For the constrained partition problem with b G (0, be), we therefore expect a scaling 
window of width at least n'^/"^ in which the probability of perfect partitions lies strictly 
between and 1. This is to be contrasted with the unconstrained case, where we had a 
very narrow scaling window of width Qin"^) about the transition point k^, see 

Next let us consider the statements of Theorem O Here again the situation is much 
more complicated than in the unconstrained case. By Theorem iBl and the lower bound in 
Theorem the minimum discrepancy changes from being at most one to being expo- 
nentially large as k crosses the interval [/€_, Kc\- However, we also prove (see Section IFT^ 
that the expected number of perfect partitions remains exponentially large until k reaches 
a value strictly exceeding Hc- This is the content of the following theorem and remark. 

Theorem F Let i e {-1,0,1} andbe (0,1). Then 

lim \n-HogE{Zn{i,s)) - R{K,b)] =0 (2.13) 

n— >oo 

where 

R{K,b) = iJ((l + 6)/2) + A6 + log(A~^sinhA)-Klog2, (2.14) 
with II{u) = u\og{l/u) + (1 — m) log(l/(l — u)), and X satisfying cothA = A^"*^ — b. 
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Figure 2: The curves k = Kc(h) , k = Kg (6) and b = be 



Remark 2.5 Graphing the curve R{n,b) = 0, i.e. 

H{{1 + b)/2) + Xb + log(A~i sinh A) 



Ke{b) :-- 



log 2 



(2.15) 



0, 



we see that it lies strictly above k = Kc{b), except at the only common point k = l,b 
see Fig. In particular, the curve intersects the b-axis at b = 0.56 ■ ■ ■ > be = 0.41 .... 
Thus for the points [n, b) between the curves k = Hc{b) and n = He{b), the expected number 
of perfect partitions grows exponentially, while w.h.p. there are no perfect partitions at all. 
This complex behavior did not manifest itself in the unconstrained optimum partitioning 
problem f^. 

We close this section with a discussion of the results of Theorem lEl 

Remark 2.6 Clearly, the fraction [Fn{K,,b)]^^ is the expected number of times one has 
to generate a uniformly random basis solution of the LPP to get a basis solution with the 
optimal subpartition property. In absence of a better candidate, seems to be 

a possible measure of algorithmic difficulty of the integer partition problem. Our theorem 
says that w.h.p. this measure is lowest for the sorted phase (b > b^) where [Fn{K,, b)]'^^ = 
next lowest for the perfect phase (b < be and k < (b)) where [F„(/t,6)]-i = 2'""+°("), and 
indeed hardest in the hard phase (b < be and k > Ke{b) ) where [Fn{n,, ^)Y^ ^ 2('^-(^)+°(^))". 

If we make the additional assumptions of Remark \2.,% we have that [Fn{K,,b)]~^ = 
2Kn+o(n) b < be and k < Ke{b), while b)]^^ = 2'^'="+°(") for k > Ke. This gives an 

easy- hard- easy picture along any curve b = k, > 0, that crosses the lines k, = He{b) 
and b = be, provided that b{K) is strictly increasing in k. Indeed, at a point {k, b) on such 
a curve [Fn]~^ grows exponentially in uk, as long as n < t^eib^n)), i.e. until the curve 
b = intersects k = Keib). For larger values of n, [Fn]~^ grows at the exponential rate 
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nK,c{b{n)) < fiK. Once the curve b = h{K) crosses the horizontal line b = be, the problem 
becomes even easier with [Fn]^^ = 1. In fact, above this line, the problem becomes easy 
in the usual sense, since the rounded and the fractional problem are identical, and sorted 
partitions can be found in linear time. 

3 Preliminaries and Outline of Proof Strategy 

In this section, we define our notation, review tlie heuristics of the proof, and point out 
why naive extensions of the unconstrained analysis of [3] fail in the constrained case. 

3.1 Sorted Partitions 

We first discuss our strategy to prove that in region III, the optimal partition is sorted 
and has discrepancy of order Mn. To this end, we consider n weights Xi, . . . , Xn, chosen 
uniformly at random from {1, . . . , M}, and reorder them in such a way that their sizes are 
increasing, X^i) < X^2) < ■ ■ ■ < -^7r(n)7 where 7r(l), . . . , 7r(n) is a suitable permutation 
of 1, . . . ,n. Since M is assumed to grow exponentially with n, we have, in particular, 
ra^ = o(M), which implies that w.h.p. no two weights are equal. So w.h.p. the permutation 
vr is unique and X^(i) < Xt,(^2) < ■ ■ ■ < -^7r(n)- 

Given a bias s > 0, (with s = n(mod 2)), we need to find an optimum partition that 
puts k = {s + n)/2 integers in one part, and the remaining n — k integers into another 
part. One such feasible partition is obtained if we select the k smallest integers for the 
first part; we call it the sorted partition. It is coded by the cr, with ^^^(i) = I for i < k 
and (T7r(i) = —1 for i > k. If the total weight of (n — k) largest weights is, at most, the 
total weight of k smallest weights, then it is intuitively clear that the sorted partition is 
optimal. More precisely: if 

k n 

j=i j=k+i 

then the sorted partition is the unique, optimal partition, and the minimal discrepancy 
is 

dopt = Ss{X). (3.2) 

See Section for a formal proof. 

To determine the phase boundary of the phase III, we thus have to determine the 
region of the phase diagram in which w.h.p. the sorted partition meets the condition 
()3.1|) . Leaving the probabilistic technicalities out of our heuristic discussion, let us replace 
the condition (|3.1|) by its mean version, namely E((55(X)) > 0. Consider an arbitrary 
b G (0, 1]. Let Xo = (1 + b)/2 and Mq = [xqMJ . For a typical set of weights Xi, . . . , X„, 
let us consider the sorted partition with aj = 1 for Xj < Mq, and aj = —1 for Xj > Mq. 
Since the probability that Xj < Mq is equal to Xq = Mq/M = xq + 0(M~^), we get that 

•^If ^s(^) — —1, the sorted partition is still optimal (it is, in fact, perfect). But in general, it is not 
the unique optimum partition. 
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the expected number of weights Xj with Xj < Mq is nxo, implying that the expected bias 
is 2nxo — n = nb + 0{n/M). The expected discrepancy can be calculated in a similar 
manner, giving the expression 



E 



J 
n 
M 



Mod + Mo) - 



M(l + M) 



(3.3) 



X. 







O(M-i) 



Mn 



b+1 



- + 0{M~') 



Mn. 



So, E((5s(X)) is large positive, of order Mn, iff (6 + 1)^/4 — 1/2 > 0, or equivalently 
b > be = V2 — 1. In Section 16.31 we prove the condition 6 > 6c is both necessary 
and sufficient for 5s(X) to be, w.h.p., positive, of order Mn. In language of statistical 
mechanics, we show that, for b > be, 5s(X) is "self- averaging," i.e. its distribution is 
sharply concentrated around E(55(X)). 

Remark 3.1 On the heuristic level presented here, the above arguments can easily be gen- 
eralized to an arbitrary distribution for the weights Xi, . . . , X„, as long as these weights are 
independent copies of a generic (discrete) variable X with a reasonably well behaved prob- 
ability distribution. Assuming, e.g., that the variable X/M has a limiting distribution with 
density fi, one obtains that the critical value ofb is given by be = bdfi) = 2 Jq^° fj,{x) dx — 1, 
where Xq is determined by the equation Jq° x^{x) dx = J)^ x^{x) dx. However, we have 
not tried to extend all our results to this more general ^-density case. 



3.2 Integral Representations 

Let us now turn to the much more difficult region b < be- Without loss of generality, we 
may take s > 0, so that b = s/n. 

Let Zn{i, s) = Zn{i, s] X) denote the total number of partitions cr such that <t • X = £ 
and a e = s. Guided by the results of 0, one might hope to prove that, as the parameter 
K = n~^ log2 M is varied, the model undergoes a phase transition between a region with 
exponentially many perfect partitions and a region with no perfect partitions. Since 
perfect partitions correspond to £ = or £ = ±1, we will be mainly interested in Zn{i, s) 
for \i\ < 1, while s will typically be chosen proportional to n. 

A starting point in ^ was an integral (Fourier-inversion) type formula for Zn{i) = 
Zn{i] X), the total number of cr's such that cr ■ X = namely 

2" r " 

Z^{i) = — / cos{ix)Y[cos{xXj)dx. (3.4) 

x-e(-7r/2,7r/2] ^^'^ 

We need to derive a two-dimensional counterpart of that formula for Zn{i,s). To this 
end, let us first recall that by ()2.3|1 . s = 2\{j : aj = 1}\ — n, so that a generic value s 
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of s(cr) must meet the condition n + s = O(mod 2). In a similar way, we get tliat cr ■ X 
has the same parity as the sum Keeping this in mind, we have that on the event 

{Ej = ^(mod 2)}, forn + s = O(mod 2), 

I(cr ■ X = £, o- ■ e = s) = ^ J J e'(-x-^)- ^^^y^ (35) 

a;,3/e(-7r/2,7r/2] 

thus extending (4.6) in [2]. Multiplying both sides of the identity by 2", and summing 
over all cr, we obtain 

on r r 

Z^{i, s) = — / / e-^(^^+^^) Y[ cos{xXj + y) dxdy. 

a;,yG(-7r/2,7r/2] (3.6) 



2"Pi/2(cr.X 



rr ■ e = s\X 



where cr = (cri, . . . , (T„) is a sequence of i.i.d. Bernoulli random variables with probabihty 
of (jj = ±1 equal to 1/2. 

We would hke to estimate the asymptotics of the integral in (j3.6p . which is equivalent 
to proving a local limit theorem for the conditional probability in ()3.6p . In general, 
to compute - via local limit theorems - the probability that some random variable A 
takes the value a, it must be the case that the corresponding expectation of A is near 
a. Thus the analogue of the representation ()3.6|1 for the unconstrained problem was well 
adapted to the analysis of perfect partitions. Indeed, in that case, we wanted to estimate 
lPi/2(|cr • X| < 1|X), and we had Ei/2(cr ■ X|X) = 0. However, in the constrained case, 
this strategy cannot be expected to work for 6 > 0, since s = hn is very far from the 
expectation of cr ■ e, namely Ei/2(c ■ e|X) = 0. 

To resolve this substantial difficulty, we introduce a two-parameter family of distri- 
butions for cTj as follows: Given r/ G M, let cr = (cti, . . . , o"„) be a sequence of random 
variables such that, conditioned on X, ai, . . . , (j„ are mutually independent, and 

P(a, = 1|X) = P{iX, + 77), P(a, = -1|X) = 1 - P{iX, + 77), (3.7) 

where 

^ ^ 2 cosh u ^ ^ 

In terms of these random variables, Z„(£, s) can be rewritten as 

Z„(£, s) = e"^"(«'^'^) P(cr ■ X = o- ■ e = s|X) 

,nL„(«,r;;X) J_ jj e-'^^''+'y^E{exp{i{x(T ■ X + j/o" ■ e)) |X) dxdy, 



e 

x,ye{-TT/2,n/2] 



(3.9) 



where 

L„(e, r^; X) := ^ + ^ + - V log(2 cosh(eX,- + r/)). (3.10) 

T) rj T) t * 



n n n 
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Indeed, fix ^,77 G M. Then Zn{i, s) can be rewritten as 

Zn{i,s)= J2 I('r-X = £, r-e = s) 

Te{-i,+i}" 



n 



:-r-X=£, T:-rX=f, j = l 

-re — s T-e—s 

n -in 



t:tX=«, j' = l 



T -e — s 



t:t-X^^. j = l 

T-e — s 



= e"^"(«''''^)p(cr . X = ^, o- ■ e = s|X), 
since P{—u) = 1 — P{u), see equation ()3.8|) . 

3.3 Saddle Point Equations and their Solution 

Given ^, rj, we now face tlie problem of determining an asymptotic value of the local 
probability in ()3.9p . This will obviously be easier if the chosen parameters i and s are 
among the more likely values of cr ■ X and cr ■ e, respectively. A natural choice is to take 
i and s equal to their expected values, that is 

E(a-X|X)=£, E(cr-e|X)=s, (3.12) 

or explicitly (using ()3.8|) . ()3.7|) ) 

n 

^X,tanh(eX, + r/) = -£, 

(3.13) 



n 



tanh(^Xj + 7]) 



Note that the equations (|3.13|) also arise naturally in an apparently different approach 
to estimate the integral in ()3.6|) . the "method of steepest descent." In our context, this 
corresponds to a complex shift of the integration path, i.e., to changing the path of 
integration for x to the complex path from — 7r/2 + iC, to —tt/2 + iC,, and the path of 
integration for y to the complex path from —tt/2 + it] to —tt/2 + ir], where ^ and 7] are 
determined by a suitable saddle point condition. For general ^ and rj, this leads to ()3.9j) . 
while the saddle point conditions turn our to be nothing but ()3.13p . In fact, this is how 
we first obtained (j3.9p and ()3.13p . 

Both approaches raise the question of uniqueness and existence of a solution to the 
saddle point equations ()3.13|) . In this context, it is useful to realize that the conditions 
()3.13|1 can be rewritten as 

aL.(e,r/;X) 9L.(e,r/;X) 
' dr] 
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Therefore any solution rj) is a stationary point of tlie strictly convex function rj; X). 
If a solution exists, it is therefore the unique minimum point of L„. Using the first equation 
in (|3.9p . we see also that t]) maximizes the local probability P(cr ■ X = £, cr ■ e = s|X), 
and hence makes it easier to do an asymptotic analysis. This observation justifies our 
choice of ^,7]. 

In the actual proof, we modify this approach a little since the solution ^ = ^(X.), 
7] = 1](X.) does not lend itself to a rigorous analysis of P(cr ■ X = £, cr ■ e = s|X). Instead, 
we will resort to "suboptimal" C, = C/^i 1i where C,.,ri are nonrandom constants, and 
(C, Tj) is a solution of nonrandom equations, obtained by replacing the (scaled) sums in 
()H.1H|1 with their weak-law limits, see equations ()H.18|1 below. This way we will be able 
to establish an explicit asymptotic formula for Zn{i,s), which will ultimately lead us to 
determine the phase boundaries. 

In Section m we will show that these deterministic equations have a (unique) solution 
( = ((b), 7] = r]{b) iff 6 < 6, = 72 - 1, the same be that determines the sorted phase. 
In other words, the threshold be plays two seemingly unrelated roles: both as a threshold 
value of b for solvability of the deterministic saddle point equations ()3.18p . and as a 
threshold for the sorted partition being optimal. On an informal level, the reason for the 
coincidence is as follows: For simphcity, suppose that the weights Xj are all distinct, so 
that Xi < ■ ■ ■ < Xn after reordering. As b approaches the point where the solutions {(, rj) 
to the saddle point equations ()3.18j) stop existing, these solutions actually diverge, one 
tending to +oo and the other to — oo. According to equations ()3.7|) and ()3.8|) . this in turn 
means that P(crj = 1|X) tends to zero or one, depending on whether j < jo or j > jo, 
where jo = \{j : crj = — 1}| is the cutoff of the sorted partition for X with bias s = nbc- 
Hence, the product measure P((t ■ X = £,(t ■ e = s|X) tends to a delta function on the 
(unique) sorted partition which is the solution to the number partitioning problem for X 
at 6 = be- See Subsection 17.21 for details. 



3.4 Asymptotic behavior of s). 

Proceeding with our heuristic discussion, let us simply assume that the equations ()3.13|) 
do have a solution C, = ^(X),?7 = f](X.). Then we may hope that, with this choice of the 
parameters ^ = ^(X), rj = //(X), we have a reasonable chance to prove — at least for the 
likely values of X — a local limit theorem for the conditional probability in ()3.9|1 . namely 
that w.h.p. 

P(o- • X = £, o- ■ e = s|X) J==, (3.15) 

TTvdet Q 

/ Var(^.X) cov(..X,..e)\ 
\^cov(cr ■ X, cr ■ e) Var(cr ■ e) J ^ ' 

Here the (co)variances are conditioned on X, so, e.g., Qu = Var(cr-X|X). If ()3.15|) holds 
then by 1)3.111) . w.h.p., 

ZJi, s) ~ e"^"(«'^'^) — = e^^-^^'"'^) (3.17) 

vrVdet Q imMVdet 



where 
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where i?*-""^ is the matrix with matrix elements R^'' = ^^^Var(cr ■ X), R^^ = R^ = 
^cov(cr ■ X, cr ■ e) and i?^2^ = ^Var(cr ■ e). 

Note that, in the limit M ^ oo, Xj/M are independent, uniform random variables in 
[0, 1]. We therefore expect that as M,n —>■ oo with k = n~^log2M fixed, both C(X) : = 
M^(X) and ^^(X) are close, in probability, to the deterministic C,,r], defined as the roots 
of the averaged version of the "saddle point equations" (|3.13|1 , namely 



1 

xtanh((^x + rf) dx 



Mn 



1 

g 

tanh.{(x + rj) dx = — b, b = —. 

n 



(3.18) 



Recall that, without loss of generality, we have taken s > 0, so 6 > 0. 

Furthermore, approximating ^(X) and //(X) by MC^ and 77, respectively and using the 
bound I (i cosh I < 1, it is easy to see that, because of the weak law of large numbers, 
w.h.p. 

n 

L„(e(X),r/(X);X) = -^log (e^5(^)+^''(^)2 cosh(e(X)X, + r^(X))) 

1 

~ + bv + j log(2 cosh(Cx + ri)) dx, (3.19) 



and similarly for the matrix elements of R^'^\ 

1 

~ j x2-(^+J)(l - tanh2(Cx + r])) dx. (3.20) 


Putting everything together, we thus may hope to prove that for |^| < 1 and M growing 
exponentially with n, (i.e. logg M nn for some n-independent k), we have w.h.p. 

1 

— log ZnU, s) I log(2 cosh(^x + T])) dx + brj — K 

n J (3.21) 

= Kc{b) - K, 

suggesting that for k < K.c{b) there are exponentially many perfect partitions, while for 
K > Hc{b) there are none. 

However, this informal argument is too naive. Equation ()3.21|) could not possibly 
hold for K > t^cip). Indeed, Z„(£, s) is an integer, and thus cannot be asymptotically 
equivalent to an exponentially small, yet positive number. This means that a rigorous 
proof of 1)3.211) must be based on the condition k < Kc{b). But our heuristic discussion 
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provides no clue as to how this condition might enter the picture. Furthermore, our 
attempts to find such a proof are stymied by mutual dependence of the random variables 
P((Tj = 1|X), (1 < j < n), a consequence of the fact that (^(X),r7(X)) depends, in an 
unwieldy manner, on the whole X. This complicated dependence of (^(X),r7(X)) on X 
would have made it very hard to gain an insight into the random fluctuations of the sum 
in ()3.19|) . even if we had found a proof. 

Fortunately, once we have informally connected (^(X),r7(X)) to {C,1]) via ^(X) = 
(1 + Op{l))(/M, rj(X.) = (1 + Op{l))r], we may try to use the suboptimal parameters 
{(/M,ri) instead. The corresponding random variables ^{(Jj = 1|X) each depend on their 
own Xj, and are thus mutually independent. A key technical issue is whether the sub- 
optimal parameters are good enough to get an asymptotic formula for the corresponding 
probability P(cr ■ X, cr ■ X = s|X), given that now the random equations ()3.13|) may hold 
only approximately. Our proof below shows that they are indeed sufficient. With those 
parameters, we will be able to get a sharp explicit approximation for logZ„(£, s), at least 
in the range k, < H-{h). 

We prove the existence and uniqueness of the solution to ()3.18|) in the next section, 
and then use them in Section EH] to derive the asymptotics of s). 

4 Solution of the Deterministic Saddle Point Equa- 
tions 

Based on the heuristic discussion of the last section, we define a as the random sequence 
((Ti, . . . , an) such that, conditioned on X, the a-,- are independent and 

P(a, = l|X) = p(^C§ + r/^, P(a, = -l|X) = l-p(^C^ + r^^, (4.1) 

with P{u) defined in ()3.8|1 . and {Cirj) is a solution of the equations 

1 

j X tanh((^a; + rj) dx = 0, 

(4.2) 

j tanh((^x + 7])dx = —b. 



These are the equations (|3.18j) . except that the right hand side of the first equation is set 
0, since our focus is on £ <^ Mn. 

However, does such a solution exist? A key observation here is that the equations 
fj4.2|) mean that {(^v) is a stationary point of the function 



1 

L(C,,) := 6, + /log(2coBh(C. + 





(4.3) 
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which is the r.h.s. in ()3.19p . without the term (i/{Mn), i.e., the (weak) limit of the 
function L„(^(X), r7(X); X) defined in ()3.10|) . Since L{(,ri) is strictly convex, it may have 
at most one stationary point, and this point is a global minimum. So a solution to (j4.2j) 
exists iff L{(,r]) attains its global infimum. 

Theorem 4.1 Let < b < K := - I. Then: 

(1) L{(,r]) attains its infimum, hence there exists a unique solution {C,r]) = {({b),r]{b)) 
of the equations ()4.2j) . 

(2) The minimizers ({b),ri{b) are continuous functions, with ({b) > 0, ri{b) < and 
C(6) + 7/(6) > whenever < b < be- 

(3) L{b) := L{({b),ri{b)) , the minimum of L{(,ri), decreases with b. For b G (0,6c); 'its 
derivative is dL{b)/db = r]{b). 

(4) limb^bAib) = oo, \imb-^b,v{b) = -oo, 



-Tl{b) _ 1 



(4.4) 



and \\mb-,b^ L{b) =0. 

(5) limfe_^oC(^) = 0, Ywib^QViP) = 0? ^'^'^ limfe^o-^(^) = log 2. 




Figure 3: Numerical solution of eq. 14.21 The dashed lines are the linearized solutions 
((b) = 66 + 0(6^) and r]{b) = -46 + 0(6^). 

Proof of Theorem 14.11 

(1) Since log(2coshM) > \u\, we have 



L{C,v)>C{Cv), 



(4.5) 



where 



1 



(4.6) 
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It thus suffices to prove that 



hminf C{C,v) = oo, (4.7) 

ll(C>»?)l|oo^OO 



where ||(C,^?)||oo = max{|C|, |r/|}. 

Since C is homogeneous, degree 1, and continuous, it suffices to prove that £{(,!]) > 
for all iCv) such that ||(C)'7)||oo = 1- To this end, we will ffist show that 

min C{(,r])= min £(l,r/). (4.8) 

||(C,»?)lloo=i -i<-'?<o 

Since C{(,r]) > C{\(\, —\r]\), it suffices to consider r] < and ( >0. But rj < 0, ( > and 
11(C) ''?) II oo = 1 implies that either C = 1 ^ind — 1 < rj < or rj = —1 and < C < 1. We 
thus have to show C{(, t]) is bounded below by the right hand side of ()4.8|) if 77 = —1 and 
< C < 1- Indeed, under these conditions, ^3^ + ^71 = 1 — C^^ ^ 1 — x = |x + ?7|, implying 
that £(C, rj) > C{1, —1), which is clearly bounded below by the right hand side of ()4.8p . 
It remains to bound C{l,r]) from below, for — 1 < 77 < 0. Setting x = —r], we have 

1 

£(1, r]) = -bx + J \x-x\dx=^ + x'^ -x{l + h). (4.9) 


At X = Xo := (1 + h)/2 the right hand side attains its minimum value |[1 — (1 + 6)^/2], 
which is bounded away form zero as 6 < 6c = — 1. Thus we have proved that for 
6 G [0, v^- 1), and all C, 

£(C, r/)>c* (6) max{|C|,|r7|}, (4.10) 

where c*{h) = |[1 — (1 + b)^/2] > 0. Therefore attains its infimum, hence so does 

L{(,ri). We conclude that ()4.2|) has a unique solution solution {(^v) = {C{b),ri{b)). 

(2) Suppose ri{b) > 0. Using ()4.2|) . and ytanhy > if ?/ 7^ 0, one can show easily that 
ri{b) > and ({b) < 0. But such a point {({b),r]{b)) cannot be a minimum point of L(^, r^), 
since L(^(6), 77(6)) > L{—({b),—r]{b)), due to the symmetry of coshy, and the fact that 
bri{b) > —br]{b). Therefore t] = rj{b) < 0, and 

C + ^= (Ca: + r?)L=i>0. (4.11) 

Continuity will be proved along with (3), where we actually prove continuous differentia- 
bility. 

(3) The equations ()4.2|) are an explicit form of = 0, = 0. It is easy to show that 

T 

differentiable, and consequently 



the Jacobian matrix \ j- ^ ) is nonsingular. Therefore C(^)) vi^) ^i-re continuously 



Uib) = Uib, V, + Hb, C, v)Ub) + L,{b, C, v)Vb{b) ^ ^ ^ ^ (4.12) 
=U{b,v{b)X{b)) = v{b)<0. (4.13) 
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Therefore L{b) decreases with b. 

(4) Pick ( > 0, xq E (0, 1), and set r] = — xqC- Breaking the integral ()4.3p into two parts, 
X G [0, Xq] and x G [xq, 1], and using 

log(e" + e-") =u + log(l + e-2«), (4.14) 

we get: 

L(C,r/) S{2xl-2xo{l + b) + l) 

2CXQ 2C{l-xo) (4.15) 

+ ^ y log(l + e-") f/M + ^ j log(l + e-") dM. 



At Xo = (1 + b)/2 the quadratic polynomial attains its minimum value 1 — (1 + 6)^/2, 
which is positive since b < b^.. With this Xq, choose C = (l — Then ^ ^ oo as 

b be, so that the integral terms add up to 0(C~^), which is also the order of the first 
term. Hence L{(,ri) 0, and therefore limh^b^ L{b) < 0. To see that \imh^b^L{b) = 0, 
we just note that L{b) = min^^^ L{(, rj) > min^ ,j C{(, rj) = 0. 

To complete the proof, in ()4.15p set ( = ({b) and xq = —rj{b)/Q{b), so that rj = rj{b). 
(Note that xq G (0, 1) as C,{b) + rj{b) > 0.) Since the quadratic polynomial in ()4.15|1 is 
non-negative if 6 < be, and max(xo, 1 — xq) > 1/2, we have that 

m 

lim L(b) > liminf -i- / log(l + e~") du > 0, (4.16) 

b->bc b^bc 2({b) J 



if liminffc^b^ ({b) < oo. So, since lim^^b^ L{b) = 0, we conclude that ({b) ^ oo as 6 ^ 6c- 
But then, using ()4.15|) again, 

= lim ^ [2x1 - 2a;o(l + b) + l]^ lim [2x1 ~ 2xo(l + 6) + l] = (4.17) 

^►limxo = ^, (4.18) 
^2' 

the only root of 2x^ - 2x^2 + 1=0. 

(5) This statement is immediate from the fact that C(^)) continuously differen- 
tiable on (0,6c) and the observation that for 6 = 0, L[(,ri) attains its minimum value of 
log2at(C,r/) = (0,0). ■ 



Remark 4.1 Theorem \4.1\ can easily be generalized to the case of non-uniform i.i.d. ran- 
dom variables Xj provided Xj/M has a limiting density fi{x), x > 0. In that case, be is 
replaced by be = be{fi), as defined in B.emark [S~l\ i.e. 

pxo rxo poo 

be = 2 fi{x) dx — 1, / xfi{x) dx = / xfi{x)dx. (4.19) 

Jo Jo Jxn 
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The counterpart of L{(,r]) is obviously 

poo 

L{C,T]) = bT]+ log(2cosh(Cx + ?7))/i(x)(ix. (4.20) 
Jo 

Remarkably, this extension requires only obvious changes in the above proof. For instance, 
below is the proof of the item (1). Note that the surprising dual role of he, namely as both 
the point at which the saddle point equation stops having a solution and as the threshold 
for optimality of sorted partitions, still holds for hc{^). In fact, if correctly interpreted, this 
dual role persists deterministically for any fixed instance {Xi, . . . , see Subsection ]?.^ 

Proof of (1) for an arbitrary distribution fi: First, we write 

POO 

L{C,r])>C{C,r]) = hr]+ \Cx + 1]\ ^{x) dx. (4.21) 

Jo 

Second, we bound 



min CiCv) > min I min C(l,ri), min £(C, — 1) . (4.22) 

ll(C,r?)||cx>=l ~ V-l<»7<0 0<C<1 ^ ' 



Furthermore, 



£(C,-l) = -& + Af(C-^) 



f^y POO 

M{y) := I {1 ~ xy^^)fi{x) dx + / {xy^^ — l)fi{x) dx. 



Using HUni we see that 



Jy 



min M{y) = M(xo) = 6c(/i), (4.23) 



hence 

C{C,-l) > -fe + &c(/i) >0, (4.24) 
for b < bc{fi). Likewise, if r/ < 0, we set rj = —x, and easily obtain 

C{1, r]) = x{-h + M(£)) > 5(-6 + 6c(/i)) > 0, (4.25) 

for h < bcifi). ■ 



5 The Perfect Phase 

5.1 Asymptotic Enumeration of Partitions by Discrepancy and 
Bias 

Now that we have proved existence of the solution (Cv) of ()4.2|) for b < \/2 — 1, we are 
justified in using the marginals ()4.1|1 . It is critically important that each P(o"j = ±1|X) 
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depends only on its own Xj, so that they are mutually independent. This would not have 
been the case if we had used (^(X), //(X)), the solution of the random equations ()3.13p . 
The corresponding version of (j3.9p is 



+ + (2c°sh (Cir^ + r^)) ]/„(X). (5.1) 

j=i / 

Here /^(X) is the random integral 

/„(X) = ^ J J e-(^-+^J')/(x, y; X) dxdy, (5.2) 

x,ye{~n/2,TT/2] 



where 

fix, y; X) :=E(exp(2(x<T -X + ya- e))|X) 



(5.3) 



= ]^[p(X,/M)e'("^^+^) + g(X,/M)e-'("^-'+^)] , 
j=i 

with the shorthands p{u) and q{u) for P(Cm + ??) and 1 — P{Cu + 77), where {C,r]) is the 
solution of ()4.2|1 . and P(-) is given by ()3.8|) . Now that the summands in ()5.H1 are simply 
i.i.d. random variables, the core of the problem is to determine the asymptotically likely 
behavior of /„(X). 

Theorem 5.1 Suppose that limsup|s|/n < be, < liminf limsup(K — K-(&)) < 0, 
i = o{Mn^^'^) and n = o(M^). Then there is a constant S > such that, with probability 
l-0(e-'''°s^"), 

Here 

^ (2E{UMU)qm 2E{Up{U)q{U))\ 

\2E{Up{U)q{U)) 2E{p{U)q{U)) J ' ^ ' 

and Tn is a two-dimensional random vector which converges in probability to a Gaussian 
vector T, with mean zero and covariance matrix 

( Var{U{p{U) - q{U)) cov{U{p{U) - q{U)), p{U) - qiU))\ 

\cov{U{p{U) - q{U)), p{U) - q{U)) Var{p{U) - q{U)) ) ' ^ ^ 

where U is uniformly distributed on [0, 1]. Furthermore, ||t„|| < logra with probability at 
/east l-0(e-'^'°f'"). 

Consequently, with probability 1 — 0(6^*^'°^ "), Zn{i,s) > 1 and 

logZ„(£, s) = n[L{C, V) - 'tlog2] + n'^'^Sn + o{n^/^), (5.7) 

where 

^« = ;37? E (2 cosh (C^ + r/)) - E [log (2 cosh {(j^+v))]] (5.8) 
j=l V / 

is asymptotically Gaussian with zero mean and variance = V'ar(log(2 cosh(C?7 + ri))). 
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Remark 5.1 It is a consequence of Theorem \4.1\ and Theorem \5.1\ that for all b G [0, be), 

K-{b)<Ke{b), (5.9) 

with 

K_(0) = Kc(0) = 1 and K^{bc) = n^ibc) = 0. (5.10) 

Indeed, let us assume that liminf(fi;_(6) — k) > 0. By Theorem \5.1l we then have that 
log Zn{i, s)—[L((, 7])— K log 2\ — > in probability. In particular, since w.h.p. Zn{i,s) > 
1, we have liminf(L(C, 77) — «:log2) = liminf (/tc(&) — K)log2 > 0. So the condition 
liminf (k_(6) — k) > implies that liminf (^^(fe) — k) > 0, which proves ()5.9|) . Since 
((0) = T]{0) = 0, it follows from the definitions jOl) - (ESI) that k„(0) = K^iQ) = 1. 
Finally, by Theorem \4.1[ we have L{bc—) = 0, or ndbc—) = 1- Numerical computations 
indicate that K-(6) < Kc{b) for < b < b^ but the graphs of two functions remain sur- 
prisingly close to each other. Our limited attempts to prove this strict inequality have not 
succeeded. 

Proof of Theorem 15. H Pick a large, but fixed, B > 0. Split the integration into two 
parts, |x| < B/M and |x| > B/M, and denote the corresponding integral /„i(X) and 
/„,2(X), respectively. Consider /,ii(X) first. Begin with 

p(X,yM)e*("^^+^) +g(X,yM)e-'("^^+^) ' 

= 1 - 2pjqj{l - cos(2(a;Xj + y))) (5.11) 
< exp(^-2pjqj{l - cos(2(xXj + y)))j, 

where we have introduced the abbreviations pj = p{Xj/M) and qj = q{Xj/M). Let 

Sn = {j : X,/M < 7r/(35)}. (5.12) 

Clearly, j G Sn implies 

2|xX, +y\<2 (^-^X, + < 1^ < 2vr. (5.13) 

Now, there exist a constant ci > such that 

1 — cosa>Cia^, a G [— 57r/3, 57r/3]. (5-14) 

Since 

_ 1 1 

~ 2cosh'^{CXj/M + r]) ~ max{2 cosh2(r/), 2 cosh2(C + r?)} ' 
we thus have shown that there exist a C2 > such that 



\f{x,y;X)\ <exp (-c^ J2 (^^. 

\ jeSn / 



(5.16) 



:exp(-C2(a;^M2 + 2xyMi + y'^Mo)), 
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Mk = Mu{B) = J2 = E^f (^)' (^) ■= ^iI{x,/M<V(3B)}- (5-17) 

jeSn j = l 

Since xj^^(S) are independent with (B) /E[x!.''\b)] = 0{B), the event 

An = {Mke [0.99nE{X^''\B)), lMnE{X^''\B))], A; = 0,1, 2} (5.18) 

has probability 1 — 0(e~''^") for some 6i > 0, by, e.g., the Azuma-Hoeffding inequality. 
We continue our computation on the event An. It is easy to see that 

x^Ma + 2xyMi + y'^Mo > C3n[{Mxy + y^], cg > 0. (5.19) 

Set r„ = n~^/^ logn. From (|5.16|) and (|5.19|) . it follows that, for C4 = C2C3, 

/ \ 

JJ \f{x,y;X)\dxdy<^ jj 6"^^^' rfzirfz2 = O ((Mn)-^^^'"^''^) . 



\\(Mx,y)\\>rn 
\x\<B/M 



\||z||>logn / 



For \\{Mx,y)\\ < Tn, we expand and exponentiate: 



(5.20) 



1 ' " ^2 , ^n^v |3 , L.|3^ 



= 1 +^{xXJ + y) {p, - q,) - - {xXj + y) + 0(|xX,f + \y\ 
= exp (^i [xXj + y) {pj - Qj) - ^ {xXj + y) ^ 

+ ^{xX, + y)\p, - g,)2 + 0(|xX,f + \yf)\^ 
= exp (^i{xXj + y) {pj - qj) 

- 2pjqj{x^X] + 2Xjxy + y^) + 0(|xX,f + \y\')^ . 

Therefore, 

X exp(-(Mx, y)Q{Mx, y)' + 0{nM^\xf + n\yf)), 
where {Mx,y)' denotes the transpose of {Mx,y), and 

T„i=^(X,/M)(p,-g,), 

j 

Tn2=^{Pj-qj), 



(5.21) 



(5.22) 



(5.23) 
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Qn=5^(X,/M)>,g,, 

j 

j 
j 

Here T„i,T„2 are sums of i.i.d. random variables, and it is easy to sliow tliat tlie 
random vector 

Tn = (r„i, r„2) := n-'/\Tr,i - E(T„i), T„2 - E(T„2)) (5.25) 
is asymptotically Gaussian, with zero means, and the covariance matrix {KM{i, j)}, with 

Km{1, 1) =Vs.t{Um{p{Um) - q{UM))), 

Km{1,2) =coy{Um{p{Um) - qiUM)),p{UM) - g(f/M)), (5.26) 
Km(2,2) =Var(p(f/M) - g(t/Af)), 
and Um distributed uniformly on {1/M, . . . , M/M}. Consequently 

Km = K + 0{M-'), (5.27) 

where K is defined like Km, with Um replaced by the [0, l]-uniform U, which is the matrix 
K defined in fl5.6|) . We can use K as the limiting covariance matrix for t„. Also, again 
by Azuma-Hoeffding, for any c > 0, 

||t„|| < clogn (5.28) 
with probability 1 — 0{e~^'^^°^^ for some 62 = ^2(0) > 0. In addition, 

E(T„i) =nE[{X/M){p{X/M) - q{X/M))] = nE{U{p{U) - q{U))) + 0{n/M) 

=^^"/'^)' (5 29) 

E(T„2) =nE[p{X/M) - q{X/M)\ = nE{p{U) - q{U)) + 0{n/M) ^ ' ' 

=s + 0{n/M). 
Using the equations ()5.25|) and ()5.29p . we obtain 

T„i-£/M = n^/2(r„i + 0(nVVM+|^|/(MnV2))), 
Tn2-s = n^/\T^2 + 0{n^'^lM)). 

Both remainder terms are o(l) since n^/^ = o(M) and (. = o(Mn^/^). Since Qij is a 
sum of n bounded i.i.d. random variables, Qij = E{Qij) + 0(n^/^logn) with probability 
1 - 0(e-'^3'°f'") for some ^3 > 0. Approximating E{Qij) as E{Qij) = n[Rij + 0{M-^)] = 
n[Rij + o^n^^^"^)], where 

Rn =2E{U^p{U)q{U)), 

Ri2 =2E{Up{U)q{U)), (5.30) 
R22 =2E{p{U)q{U)), 
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we get 

Q = n{R + 0{n~^^^ \ogn)) 

with probability 1 - 0(6"''^ 

Consequently, with probability 1 — 0{e~^^^°^ 

\\{Mx,y)\\<r„ 

exp(z(Mn^/2^)(r„i + o(l)) + i{n'/%){r^2 + 



(5.31) 



\\(Mx,y)\\<rn 

exp{-{Mx, y)Q{Mx, y)' + 0{\Mx\^n + \y\^n)) dxdy 



Mn 



Vdet R 



exp(-ir„i?-V„)+o(l) 



(5.32) 



Using ()5.2()|1 and ()5.28|1 . we see that the r.h.s. of ()5.32|1 also gives the asymptotics of 
/ni(X), the integral over all {x,y) with |x| < B/M. 

Let us turn now to /„2(X). We want to show that for some 5 > we have /„2 = o(/„i) 
with probability 1 - 0(e-'^i°s'"). First of all, by ((^3), O and the definition of /„2(X), 



|/n2(X) 




-ie{xi-X2)-is{yi-y2) 



F(x,y;X)c?xdy, 



(5.33) 



i/l,H2S(-'r/2,vr/2] 



where 



^(x, y; X) =/(xi, yi, X) f{x2, y2] X) 



n[ 



2 i{x'^X,+y{) , 2 -i(x'^X,+y[) 



+ qje- 



and 

Therefore 



= Xi - X2, X2 = Xi + X2; 2/1=2/1- 2/2, 2/2 = 2/1 + 2/2- 
1 



E(|/„2(X)|^)<- 




|F(x,y)rc?xrfy, 



(5.34) 
(5.35) 

(5.36) 

(5.37) 
(5.38) 



!/l,y2e{-T/2,ir/21 



where 



j j 
E^(^VM)g(jyM)(e^(-^^-+^^) + e-^-^^'^^^)). 



(5.39) 



To proceed, we need the following lemma. 
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Lemma 5.1 Let g{u) he continuously differentiable on [0, 1]. Then, for x ^ 27rZ and for 
all y, 

i\/r 

ngw + iww 



< 



M 

where \\g\\ := max{|5f(-u)| : -u G [0, 1]}, and \\g'\\ := max{|5f'('u)| : m G [0, 1]}. 
Proof of Lemma 15. IL First write 



(5.40) 



M 



i=i i=i 

1 



gU/M) 



A/-1 



+ - ^?(j7M))e^(-(^-+^)+^) 



. (5.41] 



Since each of the differences in the last sum has absolute value bounded by M~^||(yf'||, 
while the first two terms are clearly bounded by \\g\\, we obtain (j5.4(J|) . ■ 

Returning to the proof of the theorem, we observe that since \xt\ < 7r/2, we have 
Wt\ < 7T, t = 1,2. Furthermore, 



B/M <\xt\ = 0.5\x\±x'2\, t = l,2, 



(5.42) 



implies that max{|x'^|, \x2\} > B/M. If > B/M, then applying Lemma l5.ll to the 
first two sums in ()5.39|) . we obtain 



i^(x,y)| <-5^p(j7M)g(j7M) + 0(5-^) 



--2 [ p{u)q{u)du + 0{B-^). 
Jo 



(5.43) 



If Ix'al > B/M, then hkewise 

|F(x,y)| < [\p\u) + q\u))du + 0{B-') 
Jo 

And, if min{|x;|, Ix^} > B/M, then 

|F(x,y)| = 0(5-1). 



(5.44) 



(5.45) 



Since the right hand side of ()5.43j) is dominated by the right hand side of ()5.44|1 (just use 
that 2pq < + g^), we conclude that there is a constant c independent of B such that 



|F(x, y)| <p + c5-i or |F(x, y)| < c5-\ 



(5.46) 
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depending upon whether only one or both exceeds B/M. Here 



1 



p = {p (u) + q (u)) du 







j ^1 — - cosh {C,u + 7])^ du 



tanh(C + r]) - tanh(r7) (5.47) 

_ 2-K-(6)_ 

We conclude then that 

E(|42(X)p) < {c/Br + 0{M-\p + c/Br). (5.48) 

Since Mp^ is exponentially small if limsup(/t — /t_(6)) < 0, we get that for B large enough, 
there exists a ^4 > such that 

E(|/„2(X)|2) <e-^^"(Mn)-2. (5.49) 



On the other hand, |/ni(X)|2 = (Mn)-2e^('°s"') with probability 1 - (^(e-'^^iog^n^) _ 
0(e-^^'°s'") by (IK:^ and (O^ . Thus |/„2(X)| = o(|/„i(X)|) with probability at least 
1 - 0(e-^5iog n^)^ implying To prove ()5.7|) . we just note that the prefactor in ()5.1|) 

can be rewritten as 

Ctt + sr/ + ^ log (2 cosh {Cj^ + ^)) ) 

= exp (^-^ + sri + n^/'^Sn + ^ cosh(Cx + ?7))c/x + 0{nM'^)\ (^-^O) 

= exp (^nL(C, r]) + n^/^S^ + ©(nM^^) + C^/M 

while, with probability 1 — 0(e"'^^°^^'*), 

4(X) = M-ie°(^°§'") = M-ie°("'^'). (5.51) 



Remark 5.2 ^45 i/ie reader may have noticed, the condition that I has the same parity as 
Xj has not been used in the above proof. Thus the asymptotics stated in ()5.4|1 hold for 
all i with i = o{Mn^/'^), independent of the parity of i. But this does not mean that the 
corresponding asymptotics hold for the number of partitions Zn{i,s), since ()5.ip is valid 
only if the parity of i is the same as that of Ylj Xj . If this condition is violated, the left 
hand side of ()5.ip is zero, while w.h.p., the random integral /n(X) is different from zero. 

Applied to the special case < 1, Theorem 15. II asserts, roughly, that for every point 
{b, K,) such that b < be and k < K-(&) < 1, w.h.p. there are exponentially many perfect 
partitions. In fact, if in the right hand side of ()5.1|1 the random integral /„(X) is replaced 



Constrained Integer Partitions (DRAFT, February 24, 2003) 



29 



by the leading term in ()5.4|) . then the resulting product remains exponentially large within 
the narrow (crescent-shaped) region between k = /«_(6) and k, = K,c{b)- In principle, this 
may mean that the likely number of perfect partitions remains exponentially large in this 
extended region! An extensive numerical simulation (see Section ^ strongly suggests 
that the expected logarithm of the number of perfect partitions at every point of the 
region k < K,c{b) is extremely well approximated by the expected logarithm of the above- 
mentioned product. Based on our experience with the unconstrained problem and these 
simulations, we expect that w.h.p. at least the weaker formula 

log Znii, s) = n[L(C, r/) - log 2 + o(l)]n, (5.52) 

cf. fl5.7|) . remains valid in the whole region limsup(/t — ndb)) < 0. 

5.2 Distribution of the Bias 

Let us have a closer look at the case \i\ < 1 and s = o{n). A simple computation shows 
that 

( = Q- + 0((s/n)2), 7] = -4- + 0{{s/ny), (5.53) 
n n 

so that 

p{u) = PiCu + r]) = l + Oi\s\/n), (5.54) 

and the entries of the covariance matrix K are of order 0{\s\/n). Furthermore 

[1/6 + 0{\s\/n) l/4 + 0(|.|/n)\ 
\l/4 + 0{\s\/n) 1/2 + 0{\s\/n)J ' 

so that deti? = 1/(48) + 0{\s\/n), and Q yields 

/„,(X) = (l + o,(l))i^. (5.56) 
In addition, the exponential factor in the formula for Zn{i, s) becomes 2"'e'^", where 

Sn = -A- + 0{\s\/inM) + \s\yn'') 



n 
1 " 

+ ^Y.(^X,/M + vf + 0{sVn'^ 



(5.57) 



By the central limit theorem and ()5.53|) . the sum can be written as 

C^(nE(f/2) ^ Op{n^^^)) + 2C7]{nE{U) + Op{n^/^)) + rfn 

=4 + 0(|.|Vn^) + 0,(.V^'/') (5.5^ 

=4- + 0p(l), 
n 
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uniformly for |s| < r^l'^u) ^(n), where uj{n) oo, however slowly. Thus ()5.57|) simplifies 
to ^ 

Sn = -2- + 0p{l). (5.59) 
n 

Using this formula and ()5.56p we obtain 

Z^{e, s) = {l + 0,(1))^-— (5.60) 

uniformly for \s\ < n'^^^LU~^(n) and \i\ < 1; and, of course, s = n(mod 2), and i = 
T^jXj (mod 2). In [3 it was shown that, for Mn^l^/T' 0, F„(£) = E^-oo the 
total number of partitions'^ with cr ■ X = £ is asymptotic, in probability, to Let 
Sn denote the bias of a perfect partition cr*^") chosen uniformly at random from all perfect 
partitions. The formula for Yn{t) and ()5.60|) prove the following corollary of Theorem 15.11 

Corollary 5.1 Assume t/iatlim sup Mog2 M < 1. Then 

= s\X) = (1 + Op(l))^e-2«'/" (5.61) 



uniformly for \s\ < n^/'^uj ^{n), (s = n(mod 2) ), and consequently 

x] ^pJ- [ e-'^^'^du. (5.62) 




< a 



Remark 5.3 Thus the bias of the randomly selected (typical) perfect partition is exactly 
of ordern^/'^ , just like tn, the bias of the sequence ofn flips of a fair coin, i.e. the difference 
between number of heads and number of tails of a fair coin. However, 



i.e. in distribution the bias of the random perfect partition is, in the limit, half as large as 
the bias of the sequence ofn coin flips. Perhaps we should have anticipated some reduction 
of the typical bias, since perfect partitions are by definition those with the smallest discrep- 
ancy, which should favor smaller bias. In retrospect, the fact that most perfect partitions 
turn out to have small bias may be responsible for the greater mathematical tractability of 
the unconstrained problem. 



''Note that Yn{i) is equal to the total number of perfect partitions if ^ = 0, and equal to half the total 
number of perfect partitions if ^ = 1. In a similar way, Zn{i, s) is the total number of perfect partitions 
with bias s if ^ = 0, and half that number it £ — 1. 
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6 The Hard Phase 

6.1 Lower Bounds on the Minimal Discrepancy 

Cautiously extrapolating the asymptotic formula in Theorem 15.11 we expect that w.h.p. 
there will be no perfect partitions in the domain where this expression tends to zero, that 
is where k > Kc{b). Here we establish this result outside a small window of width n~^/^ 
above Kc- 

Theorem 6.1 Suppose that limsup|s|/n < b and that < liminfK < limsup/t < oo. 

Let Sn be the random variable defined in (|5.8|) . let in be a sequence of positive integers 
with In = oiMn^/"^) and let \i\ < £„. Then, with probability at least 1 - 0(6" 

and 

5^ <42['^^('')-''l"e"''"^"ei°^'"+^(^"/*'). (6.2) 

e:\e\<i„ 

If [k — Kc{b))n^/'^ oo, we thus have 

SO that w.h.p. there are no perfect partitions. 

Remark 6.1 (i) The proof of (|6.3|) can be generalized to show that there is a constant 
5 > such that, with probability 1 - 0{e~^^"^^"), 

dopt> [2^^-^^W-^-'^'^°Sn]n^_ ^g_4^ 

For K — Kc{b) > 1 + n^^/^logn, there are therefore no perfect partitions with probability 
l-0(e-'5'°s'"). 

(a) If \k — Kc{b)\ = 0{n^^^'^), the term n^^'^Sn is larger than n\K — Kc{b)\ with positive 
probability, implying that with positive probability, Zn{i,s) is smaller than 1, and hence 
zero. The above theorem therefore implies that for \k — Kc{b)\ = 0(n~^/^), the probability 
that there are no perfect partitions stays bounded away from zero. 

(Hi) As mentioned in Remark we believe that dopt = 2^'^^'^"'^°^^^^^'^ whenever 
liminf(/€ — Hc) > 0. In other words, we believe that for every e > 0, w.h.p., dopt < 
2[K-Kc+e]n^ // we assume such a bound, then w.h.p. the number of optimal partition 
Zopt is bounded by the right hand side of ()6.2|) with = |^2[''~'''="'"^]"J , implying that 
w.h.p. Zopt < 2^*^". Since e was arbitrary, we get Zopt = e"^^"^^ whenever \immi(K—Kc) > 0. 

Proof of Theorem 16.11 and Remark 16.11 (i). The bounds ()fi.3j) and ()fi.4j) follow 
from ()6.2|) . Indeed, let tOn be such that tOn ^ oo and [k — Kc{n)]n^/'^ — a;„ ^ oo as 
n —> oo. Setting = \2'''^-'^c{b)]n-n^/^uin ^j^g bound 1)6. 2|1 immediately implies dopt > in, 
which gives (Q. To prove (Q, we set C = [21'^-'^-^-""'^" i°s"]'^J and observe that 
for 5 sufficiently small, Sn is bounded by |logn with probability 1 — 0{e~^^°^ "). As a 
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consequence, the r.h.s. of ()6.2j) goes to zero with probabihty 1 — 0(e~'^'°s^"), implying 
again that dopt > 

It is thus enough to prove and (|6.2p . Note that the bound (|6.2|) is not just a 

consequence of the bound ()6.1|) since the intersection of 2£„ — 1 events happening with 
high probabihty does not necessarily happen with high probability if in is not bounded. 

Using ()5.5U|) . we rewrite s) as 

Zn{i, s) = 2"«^We"''"^"+';^/*-^+«("*^"')Z„(£, s), (6.5) 
where S'„ is defined in ()5.8j) and 

Zn{i,s) = ^ J e-'^'^+^y^f{x,y-lC)dxdy, (6.6) 

( — 7r,7r] 
!/e{-7r/2,7r/21 

with /(x, ?/;X) as defined in ()5.3j) . In contrast to the relation ()6.5j) holds whether 

£ has the same parity as 'Y^- Xj or not, since x is now integrated over [— tt, tt) instead of 
[— 7r/2, 7r/2). Introducing finally 

Z„(s)= ^„(£,s), (6.7) 

i:W<in 

we have 



t\e\<in 

Since nM~^ = o(log^n), the bounds ()6.H) and ()6.2|) are equivalent to proving that, with 
probability 1 - 0(e-^°g'"), we have Z„(£, s) < M-^e^"^^"" and Z„(s) < CM-^e^°g'". 

We will prove these bounds by establishing a suitable bound on the expectation of 
Zn{£,s). To this end, we first rewrite E(Z„(£, s)) as 

EiZnii, s)) =^ II e-^'^-'^yrix, y) dxdy, (6.9) 

ye(—^/2.7v/2] 

^) =Jf E^'(^VM)e^(^^+^) + E ?(j7M)e-*(^^+^). (6.10) 
i j 

As in the estimates of E(|/rt2(X)p) in the proofs of Theorem 15.11 we need the bounds of 
\f{x, y) \ for various ranges of x, y. 

Pick B >0. Let |a;| > B/M. Using Lemma I01 and (jOTl|l . we get 

|/(x,y)|<c/S. (6.11) 
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Let |x| < B/M. Setting x = z/M, we have \z\ < B. For these x's, let us bound y)| 
more sharply. We have 



1 ^ 

-^p(j7M)e 



i{zj/M)+iy 



1 2 
^ J^p^O'/M) + ^ E P(ji/M)p(j2/M) cos(^(ji/M - jVM)) 



f I V 2 

V i=i / ii<j2 



/M)p(jVM)(1 - cos(;2(ji/M - J2/M))). (6.12) 



Here 



1 

-^p(j7M)= / p(n)rf« + 0(M-i; 

7 = 1 



Pick a small e and consider ji < j2 such that 



(6.13) 



(6.14) 



Clearly there are 0(/?M^) such pairs (ji, 72)- For those (^1,^2), and < B, there is a 
positive constant c = c(e) , such that 

l-cos{z{jjM-j2/M))>c[z{j,/M-j2/M)Y (6.15) 
So the double sum in ()6.12|1 is bounded below by 



^ 5^ p{n/M)p{j,/M){j,/M - j,/Mf = {c'B-' + 0{M-^))-. 

\h/M-j2/M\<f3 

C 



B 



p{ui)p{u2){ui - U2) duidu2. 



(6.16) 



ui,U26[0,ll 
"l-"2l</' 



We thus obtain 
1 *^ 

-J]p(,/M)e^ 

i=i 

so that 

A/ 



(2:j/M+y) 



< / p{u)du + 0{l/M)] - {c'B-^ + 0{M~'))z\ (6.17) 



-^J2p{j/M)e'^'^^^'+y^ < (^J^ p{u)du + 0{M-^)^e"''''~''\ (6.18) 



for some positive constant ai. Analogously to ()6.18p . 



1 

-^g(j7M)e- 



■i(zj/Af+y) 



< / g(M)c?M + 



0(1/M)^ 



{a2 > 0), (6.19) 
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hence, for a = min{ai, 02}, 

\fix, y)\<{l + 0(M-i))e-°^"'"' < e""'^"'^', a' > 0, (6.20) 

for B > 1^1 > Q{^B/M). The bounds (IFTTTl) and fTF^ indicate that is small 

for large and moderate values of z(= xM), regardless of y. The value of y begins to 
matter when z is small. To see how, we need yet a sharper bound for \ f{x,y)\ for small 
1^1. By the definition (jfj.lOj) of f{x,y), we have 

\fix,y)\' =^ ^p(jVM)p(jVM)e-(^VM-.VM) 



M2 

n,j2 



M2 



*(2(ii+i2)+2j/) 

M2 

Jl J2 



(6.21) 



so that 

+ 2(]^E^(^VM)j (^g(j7M)) (6.22) 

- $^p(ji/M)g(j2/M) [1 - cos(^(ji/M + J2/M) + 2y)] . 
ji >i2 

Here the first three terms add up to 

\ 2 

^J2iPij/M) + qij/M))j =1. (6.23) 

Furthermore, picking > small enough so that 

2zo + 2\y\ <2{zo + 7i/2) <27T - e, (6.24) 

we get: for \z\ < zq, 

1 - cos(^(ji/M + J2/M) + 2y) > c[z{3i/M + 32/M) + 2y]\ (6.25) 
The first inequality in ()6.24|1 holds because \y\ < 7i/2, a consequence of s = n(mod 2). 
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So, within the factor 2c, the double sum in ()6.22p exceeds 

5^ p(ji/M)g(j2/M) [z2(ji/M + jVM)2 + %z(jVM + J2/M) + %2] 



p{ui)q{u2){ui + U2)'^ duidu2 + 0{M ^] 



yi,"2e[o,i] 
/ 



+ 4yz 



\ 



p{ui)q{u2){ui + U2) duidu2 + 0{M ^] 



\il,«2G[0,l] 



(6.26) 



p{ui)q{u2) duidu2 + 0{M~^) 
,«2e[o,i] J 

Since the functions 1 and Ui + U2 are linearly independent, there exists Cq > such that, 
for M large enough, the quadratic form is bounded below by co(-2^ + y"^). So 

\f{x,y)\^ < 1 - Co{z' + y') =^ \f{x,y)\ < e-«(^^+^^)/2, (6.27) 

if \z\ < zo,y e (-7r/2,7r/2], M > M{zo). Putting (jOn|l and (jFTTfj) together enables us 
to conclude that ()6.27|) . with a possibly smaller cq, holds for all z,y with \z\ < B and 
\y\ < 7i/2. Note however, that cq now depends on B and goes to zero like B~^ as i? — > 00. 
Making this dependence explicit, we have that 

|/(a:,y)|<e-'=o(^^+.^)/^ 



whenever M is large enough, \z\ < B and \y\ < 7r/2. 
Finally, for \z\, \y\ both small 

1/ J 



^±^i.WM)+y) ^ 1 ± ^(^A + y) - ^(.^ + yf + 0(1^1^ + 



(6.28) 



(6.29) 



So 



1 1 



1 1 



1 + ^anz + z(6„ + s/n)y - -— Y.^z\^f + 2zy^ + y^) + 0(|z|3 + \y\ 



M' 



M 



(6.30) 

where, by the definitions p{u) = P(Cm + v) Q'(^) = 1 ~ -P(C'W + ^)) ^-^id equations ()3.8|] 
and (Ol), 



M 



M' 



u{p{u) - q{u)) du + 0{M-^) = 0{M-^) (6.31) 
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s 
n 



{p{u) - q{u)) du-h + 0{M-^) = 0{M-^). 



(6.32) 



Exponentiating the expansion for /, we arrive at 

/(x, y) = exp (^izttn + iy (^^ + 6„ j j 



xexp{--{z,y)Q{z,yy + 0{\zf+\yf: 



(6.33) 



where 



Qu =E[(X/M)2] + 0(M-2) E[(X/M)2], 
Qi2 =E(X/M) + 0(6M-i) ^ E(X/M), 
Q22=l-(& + 0(M-i))2^1-62. 

Note that the matrix Q is positive definite in the hmit since 

1 1-462 



(6.34) 



E[(X/M)'] (1 - b^) - E^{X/M) ^(1 - &') - ^ 



12 



> 0, 



since 6 < 1/2. By (OH]) and (011) 



e -^--^"(x, y) = exp ( z^(S„ + - ^(z, y) Q{z, y)' + 0(n(|z|^ + \y\')) 



n 



(6.35) 



(6.36) 



where a„ = na„ — (i/M and 6„ = 

Let us derive an asymptotic formula for E(Z„(£, s)), using ()6.1H) . ()6.28|) . and ()6.33|) - 
fOHll . By dnH) and (l?nT|l . the contribution of the (x, y) with |x| > fi/M to E(Z„(£,s)) 
is of order 0((c/5)"). Let |x| < 5/M. Switch io z = Mx,y = y. By the 
contribution of the (2;,?/)'s with ||(2;,?/)|| > r„ := n^^/^logn is of order 

Af-i y re-"^o-'/^cir = M"ie-®(^"i°^'"). (6.37) 

r>r„ 

By (I6.33j) - (j6.36|) . the contribution of the (-2, l/)'s with ||(2;,y)|| < r„ is asymptotic to 
exp (^izdn + iybn - ^{z,y)Q{y, z)'^ (^1 + 0{n\z\^ + n\y\^)^ dzdy 



2M7r2 

ll(^.j/)ll<^^ 
1 



2Mn^ 



n 



exp ( iza„ + iyb^ - -{z,y)Q{z, y)' j dzdy + 0^^^ 



1 



n 



-1/2 



. (6.38) 



\\{z,y)\\<rn 

The first term on the r.h.s. of 



is equal to 



1 



MnvrVdet Q 



exp 



-I 

2n' 



^+0( j te-'''^dt) 



i>e{log n) 



(6.39) 



Constrained Integer Partitions (DRAFT, February 24, 2003) 



37 



where the exponential term is 

1 + 0{n-\al + hi)) = 1 + 0{n/M^) + 0{n-\i/M)^ = 1 + 0(e-®(")) + 0{n-\i/M)^), 

(6.40) 

while the integral is of order e"®*-^"^ Combining the last formula with the estimates of 
contributions of other ranges of {x,y), we obtain 

E(ZM, s)) = (1 + 0(^-1/2) + 0((i/M)^n-'))-^^ ^ 

TrVl - Mn 

whenever \i\ < in = o{n^^'^M). Summing over £ G { — {in — 1), • • • ,C — 1}, the bound 
()fi.4Hl clearly implies a similar bound for Zn{s), namely 

nZn{s)) = (1 + 0(1))^^%^. (6.42) 

By the bounds ()6.4ip and ()6.42|) and Markov's inequality, we have that Z„(£, s) < 
j^^-igiog2„ ^^^^^^ ^ £„M-ie^°g'" with probability 1 - 0(e-'°s'"). Combined with 
and dnSl) this implies dnH) and (jHIS)- ■ 

6.2 A Digression: The Expected Number of Perfect Partitions 

The reader may have noticed how gingerly we have tiptoed around the first factor in ()5.ip . 
concentrating instead on the asymptotic behavior of the double integral. To see why, we 
will show that E(Z„(£, s)) remains exponentially large above the curve k = Kc{b), in sharp 
contrast to the fact that, in this domain, w.h.p. there are no perfect partitions at all. 

Theorem 6.2 Let \i\ < 1. For s > 0, s + n = O(mod 2), andb = s/n hounded away from 
and 1, 

M 2 / n \ e^'<p'^{X) ,,,, sinhA , , , 



(6.44) 



where U\ G [—1, 1] has density 



Iyei-i,i]^''/'^dy' 
and \ < is such that E([/a) = —b, or explicitly: 



X 

Consequently, 



cothA-^ = -6. (6.45) 



where 



lim -\ogE[Zn{i,s)] = R{K,b), (6.46) 

n^oo n 

i\ 1 + ^, 2 2 , sinhA , ^ 

R{k, b) = ^- log + ^- log ^— ^ + \b + log — K log 2. (6.47) 
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Proof of Theorem 16.21 In order to calculate the expectation of Zn{i,s), we use the 
following analogue of ()3.6|) , which is valid whether or not i has the same parity as Y2j • 



2" 



27r2 



-i{£x+sy) 



Y[ cos {xXj + y) dxdy. (6.48) 



a;£ { — 71 ,7r] 
i/ee(-7r/2,7r/2] 



This gives 



E(Z„(£,s)) 



e-i(^^+'y)E''[cos{xX + y)) dxdy, 



(6.49) 



where 



E{cos{xX + y)) =Re 



( — 7r,7r] 
!/eG(-7r/2,7r/2] 



M 



Re 



j^^-lgj(x+?;) 



sm 



2 



Msin ( f 



cos 



(M+ l)x 



(6.50) 



+ 1/ • 



Notice that the first factor is a function of x only, and the argument of the cosine is y 
shifted by x{M + l)/2. Integrating first with respect to y, we can choose y G (— vr/2 + 
x{M + l)/2, 7r/2 + x{M + l)/2], thus reducing the y-integral to 



TT 



e "-"^ cos^ ydy. 



(6.51) 



j/e(-7r/2,7r/2] 



Here we can write cosy = E(e*^'^), P(cr = ±1) = 1/2. So, introducing the independent 
copies (Ti, . . . , cr„ of 0", we write the last integral as 



1 / n 

2" 



n+s 
2 



Therefore we get the simpler expression 

E(Z„(£, s)) 



n \ 1 
2^ 



2 



sm 



Mx 



t-=-i + s- 



M+1 



Msin I 



dx] 



(6.52) 



(6.53) 



(6.54) 



To simplify the rest of the exposition, let us assume that sinx/2 in denominator can 
be replaced asymptotically by x/2. (A refined version of the argument shows that this 
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replacement is asymptotically correct for n = o{M). We leave it to the interested reader 
to check this.) Substituting z = Mx/2, we get then 

2e{-M7r/2,M7r/2] yv.OO) 

T ■.=s{l + M-^) -2i/M, 

so that T is very close to s, since \i\ < 1. Next, we observe that siny is the character- 
istic function of the random variable U, uniformly distributed on [—1, 1]. Hence (^^) 
is the characteristic function of S'„ = Yl]j=i Uj, where Uj are the independent copies of U . 
In particular, using the Fourier inversion formula for fs^-, the density of S'„, 

fsA-r) = ^ I e-i'^Xdz, (6.56) 



27r 

zS(— 00,00) 



we get from ()6.55|) : 

2 / n 



E(Z„(£,.))~-(„^J/5„(r). (6.57) 



(The error caused by extending the integration interval in ()6.55|) to (—00, 00) is extremely 
small, of order M~" = 2"'^" .) We cannot use the local limit theorem for /s'„(— t) directly 
as E^n = liEU 7^ — r. So we introduce an auxiliary random variable U\ with density 
proportional to e'^^ on [—1,1], choosing A such that E([/a) = — r/n, that is setting A 
equal the root of 

xe[-i,i] 

Then 

fsA-r) = e-^(--)0"(A)/5„,,(-r), (6.59) 

where Sn,x = X]j'=i ^i,A- By the local limit theorem (see, e.g., |m. Sect. 5.10), the density 
/s„,a(-^) of Sn,x is asymptotic to 

1 



\/2TTnVaiUx 

where Va.T{U\) can be explicitly calculated, giving 



(6.60) 



Var(^,) = ^ - (r/ny. (6.61) 



Thus 



as claimed. ■ 
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6.3 Optimal Imperfect Partitions 

In Theorem 16. II we proved that, for k > K,c{b) and b < be, w.h.p. the minimum discrepancy 
is at least 2"('^~'^'=(^))~'^p("^^^\ The next theorem provides a complementary upper bound 
for the minimum discrepancy. 

Theorem 6.3 Suppose that lira sup \s\/n < b and that < liminf k, < limsup k < oo. Let 

e > 0, let Sn be the random variable defined in ()5.8|) . and let in be a sequence of positive 
integers with in = oiMn^^"^ / logn) and in > 2^'^~'^^^''^^^^^ . Then there is a constant 5 > 
such that, with probability 1 — 0{e~^^°^ "'), 

J2 ^n(^,s) =C,x2["^(')-"l"e"'^'^"+°("''"), (6.63) 

where in,x. is the number of integers i with \i\ < in with the same parity as With 
probability 1 — 0(6^*^'°^^"), we thus have 

c^opt < r2("-"-(^)+^)"], (6.64) 

implying in particular that M^^dopt is exponentially small in n. 

Note that for limsup(K — K_{b)) < 0, we recover that dopt < 1- Not astonishingly, the 
proof of Theorem 16.31 follows closely the proof of Theorem 15.11 which established that, 
w.h.p., there are exponentially many perfect partitions for limsup(K — /«-(&)) < 0. 

Proof of Theorem 16.31 Let us first note that depending on the parity of Y^jXj, the 
number £„^x is either in or — 1, implying in particular that £„^x > in — ^- Using 
this fact, it is not hard to see that the bound ()6.64|) follows from ()6.63|) . Indeed, let 
4 = ^2(^-K.+e)n^ ^ i_ gj^pg ^„x > 4 - 1 > 2^'^-'^-+^)", the right hand side of goes 
to infinity, implying that dopt ^in — ^- It is therefore enough to prove ()6.63|) . Also, since 
both sides of ()6.63|) are identically zero if in,x = 0, it is enough to consider £„ x > 0. 
Analogously to the proof of Theorem 16.11 we set 

Zn{s)= J2 Zn{i,s), (6.65) 

where Zn{i, s) is the random integral defined in ()6.6|1 . Let us note, however, that Z„(£, s) 
is equal to the random integral defined in ()5.2|) if i is of the same parity as 'Y^- Xj , and 
equal to zero otherwise. The sum in ()6.65|) can therefore be replaced by the sum over all 
i with the same parity as Yj -^jy 



a:6{-vr/2,7r/21 l^-W<f^n 
ye{-TT/2,n/2] 



e 



e-''yf{x,y;X)dxdy, (6.66) 



where the sum Yl' indicates the sum over all i with the same parity as Y^j ^j- Obviously, 
we have 



< in,x (6.67) 
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and 



J2 e~''" = ^n,x(l + 0((C|a;|)2)) as W 0. (6.68) 

t\e\<i„ 

Recalling that, depending on the parity of ^^e number £n,x. of terms in the sum 

Y^' is either or — 1, we also have 



-i2kx 



k=0 



< 1 



sin((£„ - l)a;) 



smx 



< 



(6.69) 



As in the proof of Theorem 15.11 we split the right hand side of (j6.66|) into two parts, 
Jni(X) and J„2(X), for |a;| < B/M and |a;| > B/M respectively. We start with Jni(X). 
As in fj5.20|) . we restrict ourselves to the event An defined in (jS.lSj) . With the help of 
(I6.67|) . we then bound the contributions of all x with ||(Mx, y)\\ > r„ = n~^^'^ logn by 



JJ \f{x,y-iq\dxdy <^-^ jj e-^^\\^\\'dz,dz^ = o(^ 



-n,X ^_C4(log2„) 



Mn 



||(M3:,i;)||>r„ 
\Mx\<B 



|]z|| >log n 



(6.70) 

since i„\x\ < M~^£r,r„ = ofrnn^/^ log~^ n) — > 0. 



For ||(Mx, ?/)|| < Vn, we may use 
And, analogously to ()5.32|) . we have that with probability at least 1 — 0{e 



-Ss log^ n 



1 



-i£x 



\\{Mx,y)\\<rr, ^^I^K^' 



e '^''^f{x,y;X)dxdy 



Mn 



(6.71) 



Comparing ()6.70|) and ()6.71|) (and using again the bound ()5.28p . this time to get a lower 
bound on the right hand side of ()6.71|l ). we see that the last expression is a sharp asymp- 
totic formula for J„i(X). 

Next, we use (16.691) to bound 



E(|J„2(X)|2) < 




(l + 5(xi))(l + S(x2))|F(x,y)rrfxt/y, (6.72) 



i/l,H2S(-'r/2,7r/2] 



where F(x, y) is defined in fj5.39p and 
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Arguing as in the case of J„2(X), we thus obtain that E(| J„2(X)p) is of order 



ic/BY 



( 

1 + 

\ xG(-7r/2,7r/2] 



sin((£„ - l)a;) 



dx 



sm X 



+ (P + c/BT 



1 + 



sin((4 - 
I sinfxi)! 



1 + 



sin((C - 1)3^2)1 
I sin(x2)| 



dxidx2, (6.74) 



a:i,a;26(-T/2,7r/2] 
\xi+X2\<B/M 



with, as before, p = 2 . Here the single integral is of order 



sm{{in - l)x) 



■ dx 



\x\ 



sin if: I 



dt = O {login) = 0{n), (6.75) 



a;e{-7r/2,7r/2] |t|<7r(£„-l)/2 

and the double integral is of order 



B 
M 



1 + 



sin((£„ - l)x)| 



dx = 0(M-^4 log £„) = 0{ninM-^). (6.76) 

\ I siin^x ) I / 

xe(-7r/2,7r/2] 

Combining the contributions from ()6.75|) and ()6.76|) . we get that for B sufficiently large 
E( I J„2 (X) n = O (n^ (c/5)") + O (nCM-i (p + c/5)") 



O 



Mn 



o 



(( 



Mn 



<e/2)n 



(6.77) 



Here, in the second step we used that p is bounded away from and 1, and that B is 
large enough, while in the third step, we used that injM is assumed to be at least p^e*^". 
By Markov's inequality (and the fact that < ^„_x + 1 < 2£„_x whenever £„^x 7^ 0), we 
conclude that with probability 1 — 0(e~*-^''^-'"), 



Jn2fX)| < 



-n_^-(sl9>)n 



Mn 



- Mn 



(6.78) 



By dnnni), and ^f^ . J„i(X) is of order at least %fe~^°s'" with probability 

g-<54iog n some (^4 > 0, much larger than the above order of |J„2(X)|. We thus have 
shown that there is a constant 5 = 5(5) > such that with probability at least 1 — 

0(e-'^'°s'"), 



Z„(s) = (l+o(l)) 



?n,X 1 



Mn vrVdeti? 
Together with ()6.8j) . this implies the bound 



exp(-T„(4/?)~V;). 



(6.79) 



The following corollary follows immediately from Remark 16.11 (i) and Theorem 16.31 

Corollary 6.1 Assume that limsup \s\/n < h, limsupK < 00 and liminf[K — Hci^)] > 0, 
and let e > 0. Then there exists a constant 6 > such that with probability 1 — 0(e~'''°^ "), 



2^[K—Kc{b)—n ^/"^ logn]n <^ ^ ^ c2[n—n-{b)+£\n 



(6.80) 
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As argued in Remark I6.in ii). we expect that for liminf k, > Kc, the number of opti- 
mal partitions Z^pt grows subexponentially. But so far, we only can prove the following 
corollary to Theorem 16.31 

Corollary 6.2 Suppose that lim sup | s | /n < b and that < liminf k < limsup/t < oo. 

Then for every e > 0, there exists a constant 6 > such that 

Zopt < 2^'^=+^)'^ max{2-''", 2"'^-"} (6.81) 
with probability 1 — 0(e^^^"^^"'). 

Proof. Let 4 = [2['"-'"-('')+^/2]n^ ^ i_ gy ^Yie bound KM\ of Theorem lO we have that 
there exists some 6 > such that dopt < — 1 with probability 1 — 0{e~^^°^ Using 
the bound ()6.63|) . we therefore get 

Zopt < £^^2'''''*^*^~'^l"e"^''^'^"^°''"^''^-' < £^2^'^''^^)"'^]"-e^^"'^^^^°^"'\ (6.82) 

again with probability 1 — o(e^^'°s^"). Bounding < 3max{l, 2['^~'^-(^)+'^/2l"j.^ we obtain 
the bound ^W^ . ■ 



7 The Sorted Phase 

It remains to study the minimum discrepancy for b > be 



7.1 Characteristics of the Sorted Phase 

Theorem 7.1 Suppose that M ^ n'^ and liminf s/ra > be- Then w.h.p. the optimal 
partition cr* is the sorted partition obtained as follows. Order Xj in the increasing order, 
so that < ■ ■ ■ < -^7r{n) for some permutation n of {1, . . . ,n} . W.h.p. there will be no 
ties, and the ordering it will be uniquely defined. Denoting j„ = (n + s)/2 = n(l + b)/2, 
b = s/n, 

f 1, 1 < ?■ < in, 

[ - 1, jn< j < n, 

and w.h.p. the minimum discrepancy is asymptotic to "^[(1 + b^ — 2], i.e. of order Mn. 
Proof of Theorem 17.11 We begin with the following observation. If 

jn n 

then the sorted partition is optimal, and dopt = '^s(X). Indeed, let cr be any feasible 
partition cr. Then s(cr) = |{j : = 1}\ — \{j : aj = — 1}| = 2|{j : aj = 1}\ — n so that 
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\{j ■ (^3 = 1}I =in- Thus 



in n 



(7.3) 
(7.4) 



which imphes optimaUty of the partition cr* and dgpt = Ss(X.). 

In hght of this property, all we need to do is to show that, for b > be and bounded 
away from be, w.h.p. 



Mn 



[l + bf 



;i + o(l))>0. 



(7.5) 



Let U be [0, l]-uniform, and Ui, . . . ,Un be the independent copies of U. Then the 
sequence {X'-} :— {[MC/^]} has the same distribution as our sequence {Xj . 1 < j < n}. 
So we will consider {Xj} instead. Since M ^ n^, it follows easily that w.h.p. X'^^^-^ < 
■ ■ ■ < if and only if ^4(1) < • • • < Furthermore 



< \MUj] - MUj < 1, 



(7.6) 



so it suffices to show that w.h.p. 



Jn 



J^U.iJ)- E ^.(.•) = 4[(l + ^)'-2j(l + o(l)). 



(7.7) 



A second simplification is based on a fact that the sequence {C^7r(j)}i<j<n has the same 
distribution as {-^^}i<j<n, where Sj = X]i=i ^k, and Zi, . . . , Zn are independent copies 
of Z, the Exponential (A), A > being arbitrary. Choose A = 1 for certainty. Since Sn+i 
is w.h.p. asymptotic to (n + 1)EZ = n + 1, it suffices to show that w.h.p. 



i=l j=jn+l 

or, in terms of Zts, that w.h.p. 

El — E2 

where 



n 
T 



[l + bf -2 (1 + 0(1)), 



n 

T 



Sl =5]] (in -k + l)Zk, 



k=l 



]-n 



S2 =(n - jn) E ^'^^ + ^ (n - /c + l)Zk. 



k=l 



(7.8) 



(7.9) 



(7.10) 
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Finally, introduce {Zj^}, the truncated version of {Zk}, namely 

= min(Zfc,21ogn), l<k<n. (7.11) 

Noticing that 

F{3k<n: Z'^^ Zk) <nF{Z[ ^ Z^) (7.12) 

=nP(Zi > 21ogr2) = n"^ ^ 0, (7.13) 

we can and will replace Z^, by Z'^ in ()7.10|) . denoting the corresponding sums by S'^. 
Observe that 

E(Z^) = E(Zfe) - y" -2\ogn)e-ydy = 1 - n'^, (7.14) 

2 log n 

SO that 

E(SO =E(S,) +0(1), ^ = 1,2. (7.15) 

By the Azuma-Hoeffding inequality (see, e.g., ^T], Section 12.2) and Z'^ < 21og?2, we 
have: for every a > 0, 

P(|S' - E(S')| > a) < 2exp ( = 2exp ( ■ (7.16) 

^' * ^ 2n(2nlogn)2y ^\ SnHog" n J ^ ' 

Using this bound with a = n''/^, we obtain that w.h.p. 

|S:-E(S:)|<r^^/^«n^ (7.17) 

implying that w.h.p. 

Si - S2 = S; - S2 = E(S;) - E(S2) + 0(n^/^). (7.18) 

It remains to observe that 

E(S; - E'2) =E(Si) - E(S2) + 0(1) 

JnUn - 1) _ _ ^.^^^.^ _ (--in)(n-J„ + l) ^ ^^^^ 



+ 0(n) 



2 

= Y[(l + &)'-2] + 0(n), 
which completes the proof. ■ 

Remark 7.1 Consider a point {k, b) such that b < be- From the above proof, it is easy to 
show that w.h.p. the sorted partition cr* cannot be optimal. Indeed, by fj7.19p . we see that 

E(S; - S'2) = ^ [(1 + b)^ - 2] + 0{n) < -Q{n^). (7.20) 
This means that w.h.p. 

<T*-X< -e(Mn) ^ |<T*-X| > 0(Mn), (7.21) 
and we know that dopt = o{Mn) for every {k, b), with b < be- 
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7.2 The Dual Role of h 



In Theorem 14.11 we proved that be is the threshold of the values of b for solvability of 
the saddle point equations ()3.18|) . Then, in Theorem 17.11 we proved that the same be is 
also the threshold for optimality of the sorted partition. At this point, while the results 
are complete, the equality of these two thresholds seems to be nothing but a numerical 
coincidence. In this subsection, we give an explanation of this coincidence. It turns out 
that the coincidence reflects a deterministic property of the integer partitioning problem 
which holds for a broad set of X, see Theorem 17.21 below. 

We caution the reader that Theorem l7.2l does not directly imply the two Theorems 14. II 
and 17.11 Theorem 17.21 holds only for a given instance X, and states that for s/n greater 
than some bc(X.), the sorted partition is optimal, while for s/n < bc(X.), the saddle point 
equations (jH.lHj) have an unique solution. In order to apply this theorem to the random 
integer partitioning problem, we would have to address two issues. First, we would have 
to show that if 6 > be, then w.h.p., b > 6c(X). This follows in a relatively straightforward 
fashion using the techniques of the last subsection. The more difficult issue is to relate 
existence of solutions of the saddle point equations ()3.13|) to that of the averaged saddle 
point equations ()3.18|1 . This requires that we establish existence and commutation of the 
limits n ^ oo and s/n ^ b, and deal with the fact that nbc is generally not an integer 
so that, as b ^ be, the solution to the saddle point equations gives a cr with some aj in 
the interval [— 1,+1] rather than all aj G { — 1,+1}, see below. Since the coincidence is 
already proved in Theorems 14.11 and 17.11 and since the purpose of this section is simply 
to elucidate the coincidence, we do not deal with these, admittedly difficult, issues here. 
We just present the result for a given X. 

Consider n arbitrary numbers Xi, . . . , X„ G {1, . . . , M}, subject to the constraints 
that no two of them are equal, and that their sum is even (the odd case is similar and 
is left to the reader). Without loss of generality, further assume that the are ordered 
in increasing order, so that Xi < X2 < ■ ■ ■ < X„. Consider the equations ()3.13|) with 
i = 0, and define Sc(X) as the supremum over all s for which the equations ()3.13j) have a 
solution. 

Let cr be a fractional partition, i.e., let cr G [—1,1]". We say that cr has bias s, if 
Yl^=i = "5; ^iid we say that it is sorted, if o"j < (Xj+i for all i and |cTj| = 1 for all but 
at most one i. Note that there is exactly one sorted partition with bias s for any real 
s G [—n, n] . We finally introduce the critical sorted partition as the sorted (fractional) 
partition <t that obeys the condition 



There is at most one such partition for a fixed set of weights Xi, . . . , X„. With slight abuse 
of notation, we say that a probability distribution P(cr) on partitions is concentrated on 
a fractional sorted partition <t if P((Tj = 5"j) = 1 whenever \ai\ = 1, and P((jj = 1) = p 
when 5"j takes the fractional value 2p — 1. 

The following theorem shows that the critical bias for the existence of a solution to the 
random saddle point equations and the critical bias for the optimality of the fractional 



n 




(7.22) 
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sorted partition are identical. For brevity, we drop the term "fractional" throughout. 

Theorem 7.2 Let Xi < X2 < ■ ■ ■ < X„ G {1, . . . , M}, and assume that the sum Yl^i=i -^i 
is even. 

i) If £ = and < s < Sc(X), then the saddle point equations ()3.13|) have a unique 
solution {C,,ri) with —00 <r7<0, 0<,^<oo and |?7|/^ < M. 

a) For i = 0, s < Sc(X), and a solution of the saddle point equations ()3.13p . 

let Ps(-) be the probability distribution on partitions defined in ()3.7p . If s Sc(X), then 
^ y 00, rj \ —00, and the distribution Ps(-) gets concentrated on the critical sorted 
partition. 

Hi) If i = and s > Sc(X), then the saddle point equations ()3.13j) have no solution, 
and the sorted partition with bias s is optimal; if s > Sc(X), this partition has non-zero 
discrepancy, implying that there are no perfect partitions with bias s > Sc(X). 

Remark 7.2 Statement ii) clearly implies that the critical sorted partition has bias Sc(X). 
As a consequence, a sorted partition with bias s > Sc(X) has non-zero discrepancy. By 
an easy extension of the argument given for non-fractional partitions, this in turn implies 
that sorted partitions with bias s > Sc(X) are optimal. Except for the statement that the 
saddle point equations ()3.13p have no solution for i = and s = Sc(X), statement Hi) is 
therefore an immediate consequence of statement ii). 

Proof of Theorem OJ i) Given Xi < X2 < ■ ■ ■ < X„ G {1, . . . , M}, let 

n 

F(e,r^) = J]X,tanh(eX,+r^), 

(7.23) 

G(e,r/) = X^tanh(eX,+r7). 
j=i 

Since the partial derivatives dF{^,ri)/dC, and dF^^^rj) /drj are strictly positive for all 
(^,77) G M^, the equation 

F{iMO) = ^ (7.24) 

has a well defined, unique solution rj{^) for all ^ G M, and rj{^) is strictly decreasing on 
M. Let 

9{0=G{^M0)- (7.25) 

Each solution {i,rj) of the saddle point equations ()3.13p is then a solution of g{S,) = —s 
and 77 = r]{^), and vice versa. Using the fact that the derivatives of F and G are second 
order derivatives of the strictly convex function Ln{^,r]), one easily shows that g{-) is 
strictly decreasing. Combined with the fact that 77(0) = so that g{0) = 0, we easily 
complete that proof of i). Indeed, by the monotonicity of g, the equation g{^) = —s has 
a unique solution ^ G (0, 00) whenever 



^(0)=0<s<s, = - lim g{^). 



(7.26) 
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But ^ > implies ?7 = r7(^) < 0, so we are just left with the proof of the inequality 
|?7| < ^M. To this end, we just observe that F{C,,ri) = 0, ^ > and the fact that not all 
Xj G {1, . . . , M} are equal imply that 

n n 

= ^ Xj- tanh(^Xj- + r])< tanh(eM + r])^^ Xj, (7.27) 

i=i i=i 

which in turn gives + 77 > 0, as desired. 

ii) By the strict monotonicity of g, ^ 00 as s y Sc (otherwise, the equation g{C,) = 
—Sc would have a finite solution ^ < 00, which contradicts the strict monotonicity of g on 
(0,00)). If ?7 = ?7(^) stayed bounded away from —00 as ^ 00, the function F{C,,ri{^)) 
would converge to the sum of the weights Xj, which is not compatible with -F(^, //(O) = 0. 
Thus T] \ —00 as s y Sc- We now set cTj{^) = tanh(^Xj + ri{^)), so that 

n 

F{^,r^{0) = J2^M0- (7.28) 

i=i 

In order to complete the proof of ii), we have to show that (t[C,) converges to the critical 
sorted partition as ^ — > cx). 
To this end, we first note 

|l-k,(OII<2e-«, (7.29) 

for all but at most one j. Indeed, let X(^) = — r7(,^)/^, so that crj(,^) = tanh(^(Xj— X(^))). 
Let jo be such that \Xjg —X{^)\ is minimal. Since two consecutive weights Xj, Xj+i differ 
by at least 1, we conclude that \Xj — X(^)| > 1/2 for all j 7^ jo- Together with the bound 
II tanhxl — 1| < 2e~^'^| this proves ()7.29|) for j 7^ jo. 

Consider now a sequence {^r) with — cxo as r ^ 00. By compactness, we can 
always find a subsequence such that cr(^r) converges to some cr. Due to ()7.29|1 . we must 
have that \aj\ = 1 for all but at most one j. Since F{^r,v{^r)) = J2j ^ji^r)^j = 
and Cii^) < ■■■ < o-niO^ the same holds for the limiting sequence 5"i,...,5"„. As a 
consequence, cr is the critical sorted partition defined in ()7.22|) (recall that the critical 
sorted partition is unique). Thus any convergent subsequence of cr{^r) converges to the 
critical sorted partition cr, implying that cr[C,r) itself converges to ct. This concludes the 
proof of ii). 

iii) We already showed above that the equation g{^) = — Sc has no finite solution 
< 00. As pointed out in the remark following the theorem, this is the only statement 

in iii) which does not follow directly from the statements in ii). 

8 Relaxed Version of the Integer Partitioning Prob- 
lem 

It is a rather common idea to approximate an optimization problem defined with integer- 
valued variables by its relaxed version, where the variables are now allowed to assume any 
value within the real intervals whose endpoints are the admissible values of the original 
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integer variables. In our case, the relaxed version is a linear programming problem (LPP) 
which can be stated as follows. Find the minimum value dopt of d, subject to linear 
constraints 

-d<Y^ ajXj, ^ ajX^ < d, 

J2a, = s, ' (8.1) 

j 

-l<aj<l, (l<j<n). 

As usual, the LPP has at least one basis solution, i.e. a solution {cr,dopt), which is an 
extreme (vertex) point of the polyhedron defined by the constraints ()8.1|) . Let N{cr) : = 
\{j : (Tj G (—1, 1)}| be the number of components of cr which are non-integer. It is easy 
for the reader to verify that N{cr) < 2 for all basis solutions cr. In fact, N{cr) cannot be 
1 either, since in this case the exceptional aj^ ^ ±1 must be zero, which contradicts to 
s = n(mod 2). Thus, for a basis solution, N{cr) G {0,2}. N{(t) = signals that cr is an 
optimal partition. Suppose N{a) = 2, and let {ji,j2} = {j '■ o-j G (—1,1)}. Using the 
second line in (jH.lj) . we see that 

(Tn+^n^i-^M- (8.2) 

Moreover, the second condition in ()8.H) . combined with s = ri(mod 2), rules out the values 
±1. Therefore, {o'j}j-^jj^j2 is a partition of {Xj}jy^ji,j2^ of bias s. 

Our last theorem shows that the horizontal line b = be is a phase boundary for the LPP 
as well. For b > b^ the solutions of the initial partition problem and of its LPP version 
coincide. For b < be they are very far apart, in terms of the ratio of respective optimal 
discrepancies. To state this precisely, we introduce the fraction of basis solutions cr with 
a property that the deletion of the N{cr) components of cr with values in (—1, 1) produces 
an optimal integer partition for the remaining weights Xj. We denote this fraction by 

Fn{K,b). 

Theorem 8.1 Let limsup k < oo and < liminf k. 

(i) //liminf s/n > be, then w.h.p. the sorted partition cr* is an unique solution of the 
LLP (jHIH), so that in particular Fn{K, b) = 1 and dopt = 6(Mn). 

Let limsup s/n < be- 

(a) w.h.p. dopt = 0, and there are 2'^':(^)"+'^('^^''^ basis solutions (cr,0). 

(Hi) //liminf (k — /tc(6)) > 0, all basis solutions are fractional (i.e. have N{cr) = 2), and 
i/limsup(/« — K-{b)) < 0, the number of basis solutions with N{cr) = is at most 

2(Kc(6)-K)n+Op(ni/2) 

(iv) IfO<e< K^{b) and liminf (k - k_(6)) > 0, then, w.h.p., 2-«-W"-^" < F„(k,6) < 
2-K^{b)n+en^ //limsup(K - K^{b)) < 0, thcn, w.h.p., Fn{K,h) = 2-'^"+o("'/'iog"). 
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Remark 8.1 (i) As discussed in Remark \6.1\ (Hi), we believe that the number of optimal 
partitions above grows subexponentially in n. Let us assume such a bound, more pre- 
cisely, assume that for e > 0, for b < b^, and for k with liminf(fi: — Hcip)) > 0, we have 
that the number of optimal partitions is bounded by 2'^"', with probability at least 1 — o(n~^). 
Under this assumption, the proof of Theorem \8.1\ can be easily generalized to show that, 
w.h.p., 2-'^'=(^)""^" < Fn{K,b) < 2~'^'=(^)'^+^" whenever b < be and liminf(K - Kc(6)) > 0. 

(a) If, on the other hand, one believes that the asymptotics of Theorem 15. 1\ hold up 
to He, more precisely, if we assume that the bound (j5.52j) holds with probability at least 
1 — o(n~^), whenever b < be and limsup(K — K,e{b)) < 0, then one can prove that, w.h.p., 
F„,(k, b) = 2~™+°(")) for all {b, k) with b < be and limsup(/€ - Ke{b)) < 0. 

Proof of Theorem 18.11 and Remark 18. IL (i) Suppose liminfs/n > be- Then 
w.h.p. J2j '^j-^T^U) = 6(Mn) > 0, where cr* is the sorted partition, so that < 



< and 



. -, J < 3n, 

' - 1, j > Jn- 



Let cr G [—1, 1]" be an arbitrary relaxed partition with <t ■ e = <t* ■ e. The proof of (1) 
then reduces to the proof of the statement that 



ajX^(j) > ^ Or;X^(^) if (T^CT*. (8.4) 

j j 

If cr 7^ cr*, then there exists a pair of indices i < j such that 0"j < 1 and aj > —1. Assume 
that i is the first such index, and that j is the last such index. Since X^(j) < X7r(j), we can 
strictly lower the value of (jjX7r(j) by raising cij and lowering aj, and preserving ai + aj, 
until at least one of them has absolute value 1, i.e. either (jj = a* or aj = cr*. Repeating 
this procedure with the lowest i and the largest j such that (jj < 1 and aj > —1 in the 
new configuration, we will eventually arrive at the configuration cr*. Since the value of 

crjX7r(j) was strictly lowered in each step, this proves ()8.4|) . and thus statement (i). 

(ii) Let limsup s/n < be, and consider the partitioning problem for Let a > 0, 

and set = \Mn~°-~\. By Theorem 16.31 we have that with probability 1 — 0{e~^^°^ "), 
there are at least 2"'''=~'^("^^^^°sn) tuples {crj}j>s such that 



a 



jX,\<i^. (8.5) 



i>3 

We will denote the event expressed in ()8.5|1 by A. On the other hand, introducing the 
event B = {\Xi - X2I > Mn~'^^}, Oi G (0, a), we have 



M2 

\i-j\<Mn~'^l ^ ' 

=l-0(n-"^). 

Introducing C = ^4 fl i3, we have then that 1 — P(C) 0. On C, we can define 0\ = —02 
where 

ai = ^'^ = 0(n-("--)) G [-1, 1]. (8.7) 
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Clearly Yl]=i'^j-^j = 0; denoting cr = {o'j}i<j<n, we have that (cr,0) is a basis 
solution of the LPP. Thus w.h.p. the LPP has at least 2"''':~'^("^''^^°§") basis solutions. 
Conversely, suppose cr is a basis solution, with cti, o"2 G (—1, 1), and ai + (T2 = 0. Then 



j>3 



< |Xi - X2I < M = o(Mn^/2 iQg-i 



n] 



Again by Theorem 16. 31 we know that with probability 1 — 0(6"*^'°^^"), the total number of 
all such (n — 2)-tuples with bias s is 2"''*<=+'^("^^^^°s"), and for each such tuple, the feasible 
values of ai, 02 are determined uniquely. Since there are (2) ways to select ji, j2 as indices 
of components Oj G (—1, 1), and since the union of (2) events happening with probability 
1 - 0(e-^'°§'") happens with probability 1 - 0{n'^e-^^"^'' we get that 2"''-+'^('^''"i°g") 
is also w.h.p. an upper bound on the number of basis solutions cr with N{a) = 2. 

If N{(t) = then, since b < b^., cr is a perfect partition, and thus of zero discrep- 
ancy. The number of such partitions can clearly be bounded by the number of partitions 
with discrepancy smaller than in = M. By Theorem 16. 3[ this is in turn bounded by 
2nKc+0(ni/2iogn)^ Completing the proof that the number of basis solutions with iV(cr) = 
or A^(cr) = 2 is 2'*'"'=+'^("''''i°s"). 

(iii) If liminf(fi; — Kc(&)) > or limsup(K — < 0, we know a little bit more. 
In the first case, w.h.p. there are no perfect partitions, and thus no basis solutions with 
N{(t) = 0. In the latter case, w.h.p. the number of perfect partitions of zero discrepancy 
(and thus the number of basis solutions with A^(cr) = 0) is at most 2^'^''^''^~'^^^~^^p^^^^^\ 
which is negligible relative to 2'^-W"+o("'^' i°g"). 

(iv) If £ > and liminff/t — k_ (b)) > 0, then with probability 1 - 0{n^e-^^°^ for 
every subset of (^ — 2) weights Xj, there are at most 2^'*'="'*-+^)" optimal integer partitions 
by Corollarv 16.21 As in (ii), once the ±1 values of the corresponding aj are known, the 
values of the two remaining aj are determined uniquely. So w.h.p. the LPP may have 
at most (2)2^''^'-''-+^^'' basis solut ions with N{cr) = 2 that have the optimal subpartition 
property. Similarly, the number of cr with N((t) = is at most 2*^'^="'^-+^)". So for every 
e, w.h.p. 

F„(/s:,6) < 2-''-(^)"+2^". (8.9) 

If we assume that for liminf(K — ndb)) > 0, the number of optimal partitions obeys the 
bound Zopt < 2^" with probability at least 1 — o(n~^), so that w.h.p. for every subset 
of (n — 2) weights Xj there are at most 2^" many optimal integer partitions, the above 
argument gives that for liminf(/€ — Kc{b)) > 0, w.h.p., 

F„(/€,6) < 2-''=('')"+2=". (8.10) 

Combined with the bound ()8.13p below, this proves Remark 18.11 

Conversely, given ji,j2, with probability 1 — 0(6"*^'°^ ") there exists at least one 
optimal partition {o'j}j-^j^j2 of the subset {Xj}j^j^^j^, with discrepancy 



< M2-''-(^)"+''" (8.11) 
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by the bound ()6.64|) of Theorem 16.31 Given such an optimal subpartition (and assuming 
that e has been chosen smaller than fi;„(6)), with sufficiently high probability we can find 
an unique pair cTji,crj2 € (—1, 1) such that {cj}i<j<n is a basis solution of the full LPP, 
by solving 

Indeed, with probability 1 — 0(6""^^°^^ "), the sum on the right is bounded by M2~'^-^''^"~^'^^, 
and with probability 1 - 0(2-''-(^)"+^"), 

\Xj, - Xj^\ > M2-^-('')'^+^". (8.12) 

Thus w.h.p. there exist at least (2) basis solutions with the optimal subpartition property, 
so that 

2-^^(^)™ < F„(/€,6) (8.13) 

as long as < £ < K_(6). 

Consider finally limsup(/t — k_(6)) < 0. Then, by Theorem 15. 11 on the event Xj is 

even," w.h.p. we have 2('^=(^)^'*)"+'^("^''^ perfect partitions cr of discrepancy zero, each 
being a basis solution of the LPP with iV(cr) = 0. On the complementary event Xj 

is odd", w.h.p. there are 2('==('')-'^)"+o("'^' perfect partitions {aj}j^i^2 of {Xj}j^i^2, of 
discrepancy one, and we can find, uniquely, ai, (T2 G (—1, 1) such that 

di + as = 0, aiXi + CT2X2 = -J2 ^j^j- (8-14) 

Indeed \Xi — X2I is w.h.p. of order, say, Mn~°' at least (a > 0), and the last sum 
is in [—1,1]. Thus, regardless of the parity of J2jXj, the LPP has w.h.p. at least 
2(Kc(6)-K)n+o(7ii/2 logn) i^g^gjg solutions with the optimal subpartition property. 

On the other hand, the total number of the optimal subpartition basis solutions cr is 
at most 2(«-(^)-«)"+o('^'^'i°g") with probability l-0(n2e-''^°g'"). Indeed, for N{cr) = 0, ct 
is a perfect partition, and for N{(t) = 2, the ±l-valued aj form an optimal, hence perfect 
partition of the corresponding weights Xj. In either case, the number of corresponding 
perfect partitions is asymptotic to 2(''-(^)"'')"+o("'/'i°g"), and F„(fc,6) = 2-''"+o(v^iog'^). 

Finally, if we assume that the bound ()5.52j) holds with probability at least 1 — o(n~^), 
whenever b < be and limsup(«: — K,c{b)) < 0, then the above arguments immediately give 
that, w.h.p., Fn{k,b) = 2-'''^+°('^). ■ 

9 Open Problems and Numerical Experiments 
9.1 Open Problems 

Many problems are left open in our analysis. Most important is the question of how 
we characterize the phase transition from the perfect phase to the hard phase. In the 



Constrained Integer Partitions (DRAFT, February 24, 2003) 



53 



unconstrained case [3] , our theorems would have allowed us to give at least three equivalent 
definitions. 

First, in the unconstrained problem, we defined the phase transition to be the point 
K = Kc = 1 &t which the probability of a perfect partition decreases abruptly from 1 to 
0. For the constrained case considered in this paper, we have proved only that such a 
transition occurs somewhere within the interval Kc{b)). In particular, we have not 

proved that such a transition is sharp. 

Second, in the constrained case, we could have characterized the phase transition as the 
point up to which the expected number of perfect partitions remains exponential, which 
again would have led to the value k = Kc = 1 0- In contrast, in the unconstrained case, 
here we have shown that the expected number of perfect partitions remains exponential 
until some k = K,e{b) > i^c{b), because up to k = Ke(&), with vanishingly small probability, 
there are very many perfect partitions. Thus we cannot use the second definition in the 
constrained case. Alternatively, one could ask whether the typical (say median) number 
of perfect partitions changes from exponentially large to at k = K,c{b), a definition that 
would have given the same transition point Kc = 1 iii. the unconstrained case, and that 
might also work here. 

A third possible definition of the transition point is the point above which there is 
an unique optimal partition, a definition which would have again led to = 1 in the 
unconstrained case |2J- Here we cannot prove uniqueness until we are in the sorted phase 
(6 > be), far above k, = Kc{b). 

A natural conjecture would be that at least the first and third definitions coincide and 
that both lead to a sharp transition along the line k = Kc{b). 

Assuming we could establish a sharp transition, we could then examine some other 
open problems. In the unconstrained problem, we were able to determine the finite-size 
scaling window around the transition point e. the region in which the probability 

of a perfect partition has nontrivial distribution. In jSj, we showed that the window has 
width of order n~^, and is centered at Kc + Q{n~^ logn) . In the constrained case, as 
discussed in Remark 12.41 the fiuctuations in the number of perfect partitions should be 
large enough to lead to a nontrivial probability distribution within a window of width of 
order n~^/^ about k^. Hence we expect the width of the window to be larger here than 
it was in the unconstrained case, at least of order Our numerical experiments, 

reviewed in the next subsection, support this expectation. 

Next, detailed estimates in the unconstrained case proved that the k smallest discrep- 
ancies (i.e., for a given set of n random integers, the k smallest absolute values of the 
difference in the sums of the integers in the two subsets) have a Poisson joint distribution 
PJ. This is the behavior that would be observed if the discrepancies of 2"~^ partitions 
(with 0"! fixed) were independent random variables. This result confirmed the validity 
of the so-called Random Energy Model or REM approximation for the continuous case, 
proposed earlier by Mertens ^Hj- We have no analogous estimates for the constrained 
problem. 

Finally, for both the constrained and unconstrained problems, it would be very use- 
ful to have theorems which establish the relevance of our phase transition results (and 
associated results like the Poisson joint distribution) to the complexity of the number 
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partitioning problem, and to tlie performance of widely used algorithms for the problem. 
In particular, is the perfect phase easy, i.e., in this phase, is it possible to find some perfect 
partition in polynomial time? Is the so-called hard phase actually computationally diffi- 
cult (under the usual assumptions, e.g., P 7^ NP)? What are the changes in the behavior 
of commonly used algorithms at k = Kc(&)? Is an artifact of our proofs or does it 

reflect some change in the complexity of the problem? For example, is there a change in 
the (admittedly non-rigorous, but often very instructive) replica solution at k = /t_(6)? 
Do commonly used algorithms experience any slowdown across this curve? And finally, 
can our LPP results be extended to establish genuine average-case complexity results for 
a class of partitioning algorithms? 

9.2 Numerics 

In this subsection, we present some simulations which address the question of sharpness 
of the transition and finite-size scaling. We begin with a brief discussion of methods, then 
go on to finite-size effects, since this is important for the interpretation of the results. 
Most of our simulations concern the number of perfect partitions in the crescent-shaped 
region from k = K_{b) to k = Kc{b)- 

9.2.1 Methods and Accessible System Sizes 

The general experimental setup is this: generate a random instance, i.e. n random integers 
Xj, uniformly drawn from the interval [1, M] and calculate the optimum discrepancy and 
the number of optimal solutions of the corresponding constrained partitioning problem. 
Loop over many such instances to get the empirical mean and the empirical standard 
deviation of quantities like the logarithm of the number of optimal partitions. 

The numbers Xj are constructed from the output of pseudorandom number generators; 
we use LCG64, a 64 bit linear congruential random number generator from the TRNG 
collection [2j. It is known that the least significant bits of linear congruential recurrences 
are correlated, so it is dangerous to use the elements of such sequences directly as a random 
instance. Instead, we use a random number for each bit in Xj. The bits are set to 1 or 
with probability |, depending on the most significant bits of the corresponding random 
number, thereby minimizing the infiuence of hidden correlations in the pseudorandom 
sequence. On the other hand, this restricts us to values of M that are integer powers of 
2. 

We can afford these extra calls to the random number generator, since the generation 
of the random instances is not the part of the simulation that limits the accessible system 
sizes. The hard part of the simulations is the solution of the particular instance. (After 
all, we are dealing with an NP-hard problem.) Even in the perfect phase, where a smart 
heuristic algorithm might find one of the exponentially many perfect partitions quickly, 
we want to find them all, so there is no obvious way to avoid an exhaustive enumeration 
of all partitions. The corresponding 0(2") time complexity is the limiting factor for our 
accessible system sizes. 

Horowitz and Sahni presented an algorithm that solves the unconstrained integer 
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partitioning problem in time 0{n-2^), and tliis algoritlim can easily be modified to count 
all perfect partitions for the constrained problem within the same time complexity. The 
Horowitz-Sahni algorithm achieves its prodigious speed-up by dividing the set of numbers 
Xj in two halves and tabulating all 2? possible discrepancies of the two half-sized sets. 
Each discrepancy of the original problem can then be represented by the sum or the 
difference of two elements, one from each table. The tables are sorted (this is the origin 
of the complexity 0{n-2^)), and due to the monotonicity of the sorted tables, the perfect 
partitions of the original problem can be found with only one single scan through both 
lists. 

The drawback of the Horowitz-Sahni algorithm is that it trades time against space. 
For K = 1, the Xj^s are n-bit numbers, hence the tables require 2n2t bits of computer 
memory. Equipped with 512 MByte of main memory, this means n < 50. For a single 
instance of size n = 50, the optimum discrepancies for all (discrete) values of b can be 
found in about 4 minutes on a Pentium III CPU with 800 Mhz clock rate. Averages 
over 10'^ random instances can be calculated in less than half an hour using 156 CPUs 
of a Beowulf cluster pp. Counting all perfect partitions in the perfect phase can take 
considerably longer if there are many such partitions, i.e. for small values of k. 

9.2.2 Concentration and Finite-Size Effects 

For our first numerical experiment, we study the finite-size effects of the central quantity 
in the perfect phase, namely the number Z of perfect partitions for given b and n. 

Optimistically extending the formula (j5.1|) - (j5.6|) f Theorem 15. 1|) . we expect that for 
large n 

Ef-logZ) ^ L(C, 77) - K log 2 - - log (irnVdet r) - —TtR-^K (9.1) 
\n / n \ /An 

is a valid approximation for all k < Hc{b)- It is illuminating to check accuracy of this 
approximation numerically, even in absence of an error term bound, better than o(n~^), 
which is implicit in ()9.1|) . 

Figure |3] shows the results of the simulation. For each data point we generated 10'^ 
instances of n random nn-hii integers and counted the number Z of perfect partitions using 
the Horowitz-Sahni algorithm. Each symbol in Figured denotes the empirical average over 
the 10^ values of n~^ logZ, and the error bars indicate ±1 empirical standard deviation. 

For n < 50, finite-size effects are clearly visible. On the other hand, the error bars 
decrease with increasing tt,, indicating a concentration of n~^\ogZ around its expected 
value. Note that the size of the error bars does not decrease if the number of random 
samples increases. This indicates that the error bars are a measure of the inherent fluctu- 
ations in n~^ logZ. Note also that for 6 > 0, the statistical fluctuations are much larger 
than in the 6 = case, consistent with our rigorous results in Theorem iBl and Remark |2.4[ 

The most surprising observation is that Eq. 19. H is a very good approximation for finite 
n > 30. Note that the "finite-size corrections" in Eq. 19. H i.e. the 0(n~^ logn)-term and 
0(?7.~^)-term, are essential for a good approximation: even for n = 50, the measured 
values of n~^'E{log Z) are about 20% below the predicted asymptotic values. 
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9.2.3 Sharpness and Location of the Perfect to Hard Transition 

The major open problem with the phase diagram is the behavior of the system inside the 
crescent-shaped region between k = and k = K,c{b)- From Theorems 15.11 and I6.1L 

we know that w.h.p. there are exponentially many perfect partitions for n < k_ and no 
perfect partitions for k, > k^.- What happens to the number of perfect partitions between 
K = and K = Is there a sharp transition from "no" to "exponentially many" 
perfect partitions, like in the unconstrained case or the case 6 = 0? If so, where does it 
occur? 

For all numerical experiments shown in this section, we have chosen b = 0.25 because 
here the gap between /t_ = 0.674 . . . and Kc = 0.799 ... is relatively large. Setting b = 0.25 
means that n must be a multiple of 8. 

In the first experiment, we determine Pperfect, the fraction of randomly generated 
instances that have a perfect partition. According to Theorems 15.11 and 16.21 as n ^ oo, 
this fraction should tend to 1 for k, < k,_ and to for k, > ■ Figure El shows the results 
for n = 32, 40, 48. For these finite n, the decay of Pperfect from 1 to extends over an 
interval larger than the crescent-shaped region, but the values outside this region seem to 
converge to their limits 1 and as n gets larger. 

To see more clearly what is happening with Pperfect, we define^ 7t by Pporfect(^) = | 
and plot Pperfect versus the rescaled control parameter An = (/t — K)n^/^ for various values 
of n. By definition, all these curves intersect at Ak = 0, but the simulation shows that 
the curves coincide for all values of An (Figure P). This data collapse indicates that 
Pperfect is uot a fuuctiou of the two parameters k and n, but of one single parameter 
Ak = (k - 7€)n^/^ 

Pperfect («:,n) = f{{K-Ky/^). (9.2) 

Validity of this scaling hypothesis in particular would imply that the transition Pperfect = 1 
to Pperfect = becomes sharp as n — cxd and that the width of the transition region scales 
like 0{n^^^^). The transition point K itself depends on n but seems to converge to Kc as 
n ^ oo (see Figure IE)). Our simulation results support the conclusion that the probability 
of a perfect partition shows a sharp transition at Kc'- 

hm Pperfect = I J (9.3) 

So far, the crescent-shaped region is not visible in the simulations, but this may change 
if we look at other quantities like the number of perfect partitions for k, < Hc- 

Figure [7| shows the result of a simulation with n = 40. The mean value of log 2' is 
very well approximated by a piecewise linear function 

_i ^, ,N f (k — K)log2 K<k ,^ 

n i-E(logZ(/€)) = I ^ ^ ^ (9.4) 

where k is a fit parameter that depends on n. Eq. 19. 41 is a very accurate description of the 
numerical data even for rather small values of n. The value of n infiuences only the size 

^Of course, for finite n, there is in general no solution k to the equation Pperfect ('*) = ^- Instead, we 
take K„ to be the closest linear interpolation of solutions with parameters n and M = 2""". 
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of the fluctuations of ■ log Z{k,) around its expected value and the value of k. As n 
becomes larger, the fluctuations decrease and k increases, moving beyond k_ and towards 
Kc (see Figure IHI). Note that Theorem 15. II implies that lim„^oo'^~^ logZ is linear function 
of K, for K < K_. 

Of course, the numerical data on k and k do not allow us to conclude that both values 
will converge to as n ^ oo, but it is obvious that both values increase with increasing 
n and are larger than /t_ . Again we can use Theorem 15.11 to estimate the finite-size 
corrections to Kc, 

L^C^ log (nvry/d^) Tri^-^iT ,^ , 

l^c\n) = — ^ — ; ; . 9.5 

^ ^ log2 ralog2 4ralog2 ^ ^ 

Figure IHI shows that both K and k are close to the finite-size estimate of k^. 

In all our simulations, we did not find any trace of a critical line k_ . The properties 
of the system do change for values below Kc, but above k_, but there is some evidence 
that this is a finite-size effect and that for really large systems, it is only Kc that matters. 
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Figure 4: Comparison of eq. 19.11 (lines) for 6 = (top) and b = 0.25 (bottom) with 
simulations. Symbols denote averages over 10'^ random samples of uniform numbers X, 
errorbars indicate ±1 standard deviation. 
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Figure 5: Probability of liaving a perfect partition in a random instance of the constrained 
partitioning problem with b = 0.25. Symbols are empirical probabilities found in 10^ 
random samples. The shaded area is the crescent-shaped region from to Kc- 




Figure 6: Same data as in Fig.|S[ but this time plotted versus the scaled control parameter 
Ak = (k — 'K)n^/'^ . The data collapse indicates a sharp transition as n ^ oo and a width 
0(n^^/^) of the scaling window. 
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Figure 7: Logarithm of the number of perfect partitions in random instances of the 
constrained partitioning problem with b = 0.25. Symbols are averages over 10^ random 
samples, errorbars indicate ±1 standard deviation. Inset: Magnification of the "critical" 
region, the vertical line is 
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Figure 8: Values of k that indicate a transition from the perfect to the hard phase. 



